Support unavailable
Please try again later

Writing a parallel and distributed tool for backing up a multi-terabyte data

by Marco Nenciarini for EuroPython 2011

How Python was used to implement a tool for the daily backup of an over 50 terabyte distributed data warehouse based on Greenplum Database technology.

The talk covers our experiences, from the initial customer requirements to the first alpha release of the tool, describing the main issues we faced and the Python modules we adopted to address them, while using agile methods.

Though the application is not currently available as an open-source project, the talk gives valuable insights into the development of distributed and parallel applications using Python.

Video

Do you have some questions on this talk? Leave a comment to the speaker!

New comment


Language
EN
Duration
45 minutes (inc Q&A)
Our Sponsors
Spotify
Python Experts
SSL Matrix
Wanna sponsor?