any gems to process PowerPoint files?

Andy5 · November 23, 2010, 3:31am

Does anyone know of any gems or plugins that can take a PowerPoint and create images out of every slide and also access the text in each slide?

DBL_Systems · November 23, 2010, 2:00pm

not sure if this will help but google docs just started providing 3rd party publishing/conversion from MS Office to google docs. perhaps this can get you part of the way by converting and using the gdata gem

b

Gero_Zoltan · November 26, 2010, 7:15pm

Hi,

I highly reccomend you Prezi http://prezi.com/ and not any MS product, since they are not free, and not open-source, so if any company makes anything which can use it - it still mean that you or some will pay for it in the end. And mostly PowerPoint is really out of date. cheers Zoltán

Andy5 · November 29, 2010, 3:49pm

My client has PowerPoint files and we need to process them into images and text.

I was hoping there was something out there that could do this directly.

If anyone else has suggestions, please post.

Thanks

walterdavis · November 29, 2010, 4:55pm

My client has PowerPoint files and we need to process them into images and text.

I was hoping there was something out there that could do this directly.

If anyone else has suggestions, please post.

Thanks

Not on the server, but PowerPoint itself can output a Web site from each file using File / Export. Something for a temp at your client's office to do all day long.

Walter

Andy5 · November 30, 2010, 10:26pm

Unfortunately that's not going to work either.

I really need to process these either in my Rails app or through a 3rd party service.

Anyone know of any SaaS or web services that would do this?

Thanks

11155 · November 30, 2010, 11:08pm

Andy wrote in post #965257:

Unfortunately that's not going to work either.

I really need to process these either in my Rails app or through a 3rd party service.

Anyone know of any SaaS or web services that would do this?

Thanks

How about the Google Docs API?

Best,

Vladimir_Rybas · December 1, 2010, 6:38am

Let's say the uploaded PPT is belong to Document model, and it's pages images are belong to DocumentPage model.

So you need to make Paperclip Processor, which you use in Document model. Inside this Processor you need to: 1. Create tmp folders where you will perform all operations 2. Convert PPT to PDF using PyODConverter • Webbygram 3. Convert PDF to TIFF images using ImageMagick. 4. Process TIFF images with Tesseract(http://code.google.com/p/tesseract-ocr/) to extract keywords. 5. Convert TIFF to PNG 6. Create DocumentPage models passing PNG images and extracted keywords as a parameters. 7. If all DocumentPage models are created, just go out of Processor to let the Document model be created.

Here is the Processor https://gist.github.com/723079 It's kinda messy and kinda belongs to my application, but you get the idea.

Topic		Replies	Views
display ppt file in rails application rubyonrails-talk	4	313	October 10, 2014
How to convert Image To Text in RoR rubyonrails-talk	4	439	June 14, 2011
How to Parse Microsoft Word Document rubyonrails-talk	8	265	March 17, 2011
Converting Word Documents (and other types of files) to PDF in a rails application rubyonrails-talk	4	139	January 15, 2009
How to get key and values dynamically from the pptx file or text file rubyonrails-talk	0	313	November 3, 2020

any gems to process PowerPoint files?

Related topics

More Resources