Project description

PDF Table Loader

This loader reads the tables included in the PDF.

Users can input the PDF file and the pages from which they want to extract tables, and they can read the tables included on those pages.

Usage

Here's an example usage of the PDFTableReader. pages parameter is the same as camelot's pages. Therefore, you can use patterns such as all, 1,2,3, 10-20, and so on.

from llama_hub.pdf_table import PDFTableReader
from pathlib import Path

reader = PDFTableReader()
pdf_path = Path("/path/to/pdf")
documents = reader.load_data(file=pdf_path, pages="80-90")

Example

This loader is designed to be used as a way to load data into LlamaIndex and/or subsequently used as a Tool in a LangChain Agent.

Project details

These details have not been verified by PyPI

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

This version

0.1.3

Feb 21, 2024

0.1.2

Feb 13, 2024

0.1.1

Feb 12, 2024

0.1.0

Feb 10, 2024

0.0.1

Feb 4, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llama_index_readers_pdf_table-0.1.3.tar.gz (2.6 kB view hashes)

Uploaded Feb 21, 2024 Source

Built Distribution

llama_index_readers_pdf_table-0.1.3-py3-none-any.whl (2.8 kB view hashes)

Uploaded Feb 21, 2024 Python 3

Hashes for llama_index_readers_pdf_table-0.1.3.tar.gz

Hashes for llama_index_readers_pdf_table-0.1.3.tar.gz
Algorithm	Hash digest
SHA256	`902c98e74e12fef068f60fbb022995606052eba220d02662b7383376ff4e28de`
MD5	`ff71db5596d96305c44a5ddd6a9623f9`
BLAKE2b-256	`17e1ed6c1e9742146f7e79e5072ebd1e600d3825969e1d9f9351093e0bc90ea8`

Hashes for llama_index_readers_pdf_table-0.1.3-py3-none-any.whl

Hashes for llama_index_readers_pdf_table-0.1.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0c7a3cad35062158bc498d48635c37bdb77ba81198b951ab03034ce96c8095a3`
MD5	`7f0ba9960cf15f11cd5c47541a11b7ed`
BLAKE2b-256	`a3dc8335701d2e99d3877a55713c86a489922c63b457a5b5b40fd327fb2367d1`