|Parse a PDF file to CSV, and then import to MySQL
- A base folder structure named XXX (configurable) has a list of subfolders with several hundrends of pdfs
_ A process runs to convert all pdfs to UNICODE UTF-8 text format by using xpdf and rename all processed pdfs to *.pdf_YYYYMMDD_BATCH_SERIALNO
All text files are moved to a mirror folder structure [url removed, login to view]
all processed pdf are moved to an archi...
||PHP, Perl, Python, Data Processing, MySQL
||Feb 19, 2015
||Feb 19, 2015Ended
|Convert TXT to HDF5 with Python 2 and H5PY,CSV
||Need to convert tab-delimited text files to HDF5 format using Python 2.7 (and H5PY/CSV packages). A sample text file is attached ([url removed, login to view]) along with an image of how the correct [url removed, login to view] should appear. Preference will be given to programmers who can produce clear, well-commented code.
||Jun 9, 2014
||Jun 9, 2014Ended
|Parsing HTML files into well-formatted CSV file
||I need a script to convert files generated by a newspaper database from the HTML format into comma-separated-values text files. A **java** code is preferred, but a **python** script is also acceptable.
The input file for the script contains newspaper articles and some tags (e.g., date, newspaper name, article title, etc.) that are not explicitly formatted, but can be parsed based on the HTML ...
||Java, Python, Script Install, Project Management, Engineering, Software Architecture, Software Testing, Shell Script
||Nov 22, 2011
||Nov 22, 2011Ended
|Python Script to Convert Text File to CSV
||I have a text file that I need converted into a CSV using Python.
The format currently is this:
Any line that does not parse properly should simply be dumped.
The "/"'s are always as described with a / before and afte...
||Feb 13, 2011
||Feb 13, 2011Ended