Pdf Loader
PDFThumbnailReader ¶
Bases: PDFReader
PDF parser with thumbnail for each page.
Source code in libs/kotaemon/kotaemon/loaders/pdf_loader.py
load_data ¶
Parse file.
Source code in libs/kotaemon/kotaemon/loaders/pdf_loader.py
get_page_thumbnails ¶
Get image thumbnails of the pages in the PDF file.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
file_path
|
Path
|
path to the image file |
required |
page_number
|
list[int]
|
list of page numbers to extract |
required |
Returns:
Type | Description |
---|---|
List[Image]
|
list[Image.Image]: list of page thumbnails |