Project Management Systems

Manipulating Common Data Files in Python

Copyright 2010 by Stephen Vermeulen
Last updated: 2010 Oct 23
The Python Programming Language

27 api arch archives area aspx big binary cellspacing city cloud contain creating directly doughellmann editing edition exec formats head https introduced language last left limited longer native particular pil right script src tbody toolkit types ubuntu under updated via walk weblog within working write writing wxpython
Often times it is useful to be able to read or write various common data file formats from within Python.
  • 2010-Oct-23: The PyMOTW looks at the re regular expressions module. [9416] [1]
  • 2010-May-31: Displaysing EXIF information using wxPython. [9153]
  • 2009-Jul-17: How to use the Python zipfile module. Another article showing how to use zipfile. [3256] [1] [2]
  • 2008-Nov-20: xlutils is a package for working with Excel files. [7240] [1]
  • 2008-Sep-17: This article about Creating Excel spread sheets from Python programs discusses the simple approach of just writing a CSV formatted ASCII file as well as a more sophisticated approach using the pyExcelerator module to create a full XLS file. One could also use the win32com package to create XLS files by Office Automation techniques, but this requires that Excel is installed on the machine you run the Python script on. There is some more information on using pyExcelerator here. The xlwt module (a fork of the pyExcelerator project) can be user to create XLS spreadsheet files without requiring Excel being present. [5097] [1]
  • 2008-Sep-04: pexif is a Python library for parsing and editing EXIF data in JPEG files. This is also available here. [497]
  • 2008-Jul-31: PyFFI (also here and on SourceForge) is a library for reading, writing and processing binary files - the file format is described with XML. The API docs start with a straight forward example that illustrates the power of this module. There is also a file editor called QSkope to simplify editing file formats that are supported by PyFFI. [504]
  • 2008-Feb-28: OOoPy is a module for modifying OpenOffice documents from with Python. [5182]
  • 2007-Nov-28: A simple illustration of how to use Python's zipfile module to unpack a zip file. Note the sizes of files this can extract is limited by available memory. [4205]
  • 2007-Nov-02: PyMedia, a Python module for the manipulation of wav, mp3, ogg, avi, divx, dvd, cdda... files. Here is a guide on how to install pymedia on Ubuntu's Gutsy Gibbon release. [494]
  • 2007-Sep-26: PalmDB, a pure Python library to read/write/modify Palm PDB and PRC format files. [2252]
  • 2007-Sep-04: The csv module, introduced in Python 2.3, is useful for working with various types of data table text files, such as the comma separated variable (csv) format. [1629] [1]
  • 2007-Aug-24: The Big Picture is a Python library for reading and writing EXIF and IPTC metadata in JPEG and TIFF files [503]
  • 2007-Aug-24: dbfpy, a Python module for reading and writing DBF files (as used by bBase, Foxpro, Clipper...) [502]
  • 2007-Aug-24: PythonTidy, cleans up and reformats Python script files. [501]
  • 2007-Aug-24: using os.walk() to traverse a directory tree to perform operations on particular types of files. [500]
  • 2007-Aug-24: pickfile, (PickFile home page) a module for managing files that contain pickled object [499]
  • 2007-Aug-24: Some comments on reading EXIF data from Python. There is also some support for this in the Python Imaging Library (PIL). [498]
  • 2007-Aug-24:, a module to parse, read and write JPEG EXIF and COM metadata [496]
  • 2007-Aug-24: pyUnRAR, a module for unpacking RAR archives [495]
  • 2007-Aug-24: IPTCInfo, allows you to extract and modify the IPTC metadata area in images [493]
  • 2007-Aug-24: Stani's DXF Library, is a Python library that you can use to computer generate DXF format CAD files. [492]
  • 2007-Aug-24: libextractor, a module to extract data from files of any type [491]
  • 2007-Aug-24: Extracting data from Garmin GPS devices using Python, via the PyGarmin toolkit. [490] [1]
  • 2007-Aug-24: vobject, a module for reading vCard and vCalendar files [489]
  • 2007-Aug-24: Manipulation of ID3 tags in MP3 files from within Python, the module.  eyeD3 is another module. pytagger is another which might be the only one that is not under the GPL license. id3reader is no longer maintained. mutagen is another, this is a pure Python library. [488]
  • 2007-Aug-24: How to turn a .pyc file into a code object for exec() [487]
  • 2007-Aug-23: pyExcelerator: is a module for generating Excel97 files directly from Python [128] [1]

              back to home