Completed

PARSING SCRIPT for *.PPTX slides content (Powerpoint feature extraction)

This project was successfully completed by nbohorquez for $200 USD in 8 days.

Get free quotes for a project like this
Employer working
Completed by:
Skills Required
Project Budget
$30 - $250 USD
Completed In
8 days
Total Bids
6
Project Description

Phase 1:
-------
We need to have JSON describing slide structure in basic terms :
1) texts = array of text objects , when the text has multiple lines with different fontsize, this should be treated as subtext object , where their relative position is noted (line 1 : postion 0,0 , second line : 0,1 , third line : 0,2)
2) images = extract file to storage and note path, note crop and position & size of image , preferably in pixels or % of slide width/height
3) notes = extract slide notes into JSON structure - sample structure for sample slide attached separately ... JSON information in this file are more illustrative describing what needs to be done rather than detailing each field


Phase 2:
-------
A) map to layout
the general idea is to extract grid layout from the content of the slide, there is need for
slight tolerance for items - idealy as option of 5-10px , (so text slightly overlapping image is still considered as text within image and therefore treated as in same grid position / label ) , top-left corner is important for assuming which row/colum is this image present

B) Layout 2 - grid
If there are multiple images
! grid should be projected onto image to determine number of columns / rows. Number of columns : determined by maximal number of images+texts in any column Number of rows : determined by maximal number of images+texts in any row
Texts present within bounding box of image and +5-10pixels (ideally variable as option) should be treated as text with same grid position as that image - Top Left of text is determining to which image is this text pinned , if text crosses multiple images , its span should be noted , if text has multiple lines each with different font size, is should be marked as subtext (with position)


MORE DETAILED JOB BRIEF IN ATTACHED PDF!

Looking to make some money?

  • Set your budget and the timeframe
  • Outline your proposal
  • Get paid for your work

Hire Freelancers who also bid on this project

    • Forbes
    • The New York Times
    • Time
    • Wall Street Journal
    • Times Online