parse text and extract tables from a pdf — parse_nppdf • llm4np

parse text and extract tables from a pdf

Usage

parse_nppdf(pdfpath)

Arguments

pdfpath: character(1) path to a PDF, may be a URL, expected to be a neuropsychology evaluation report

Note

There can be tabulapdf errors thrown that do not register in R. They do not seem to be trappable. Be sure you are comfortable with the event noted.

The original URLs for the PDFs included with the package can be found using `llm4np:::sqspsrc` and `llm4np:::pecssrc`.

Examples

deidpath = system.file("pdfs", "de-identified+report+1.pdf", package="llm4np")
deid = parse_nppdf(deidpath)
deid
#> Parsed NP pdf with 7 pages and 9 tables.