Support unavailable
Please try again later

Fast Data Mining with PyTables and pandas

by Yves Hilpisch for EuroPython 2012

In a number of industries, like financial services or utilities, it is important to analyze huge sets of data in an efficient and fast manner. Typical solutions that are based on SQL databases or follow some kind of data warehouse approach are generally quite expensive and demand for huge computing power.

Using the Python libraries PyTables and pandas brings high performance data mining to your desktop computer or even notebook. The talk illustrates how to beneficially apply these libraries in the context of financial time series and other data sets. It is also illustrated how you can implement fast calculations on data sets which do not fit into the memory of your computer.

In addition, the talk provides a number of examples for the visualization of your data mining efforts.



  1. Gravatar
    no audio??
  2. Gravatar
    I checked and the audio works well?!
  3. Gravatar
    only left channel at the beginning, and audio doesn't really start until 1:15
  4. Gravatar
    Yves, I attempted the simulation. The object "m" was used in the range() function. I got an error message say range() is expecting an end param in the type INT. So I added one statement prior to the for loop m = int(m). It worked. The simulation objection S, can be printed. However, when I attempted the option pricing, the calculation of VO did not result in any error but there was no result returned. Would you suggest what is wrong with my work? I was using a ipython notebook under Enthought EPD/Canopy.
  5. Gravatar
    Would like to help. Just send me or post the code/IPYNB.
  6. Gravatar
    In Video: There are subtitles for part 2?
    Spanish translation exists for the two videos?
    Thank you.

New comment

60 minutes (inc Q&A)

Tagged as

ERP numpy numeric case-study database
Our Sponsors
Python Experts
SSL Matrix
Wanna sponsor?