Extracting table data from PDF
by Wolfgang Schemmel (Perleone) (Hannover.pm, Erlangen.pm)
Extracting table data from PDF aimed at Beginner and is held in English. This talk starts on 2016-03-11 at 11:20 for 20 minutes. It takes place at the DATEV.
Transforming a table (for example a spreadsheet) into PDF is trivial, usually you just have to print it as PDF. Getting the same data out of the PDF in a structured, reusable way is quite another story. Using a real-life example (cafeteria menu) I will try to not end up with a jumbled mess of a word salad.
Slides: http://perleone.github.io/talks/extracting.table.data.from.pdf.gpw2016.pdf
Tags: Tags: extraction pdf table
Interest in attending: