Adobe
request_adobe_service ¶
Main function to call the adobe service, and unzip the results. Args: file_path (str): path to the pdf file output_path (str): path to store the results
Returns:
Name | Type | Description |
---|---|---|
output_path |
str
|
path to the results |
Source code in libs/kotaemon/kotaemon/loaders/utils/adobe.py
make_markdown_table ¶
Convert table from python list representation to markdown format. The input list consists of rows of tables, the first row is the header.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
table_as_list
|
List[List[str]]
|
list of table rows Example: [["Name", "Age", "Height"], ["Jake", 20, 5'10], ["Mary", 21, 5'7]] |
required |
Returns: markdown representation of the table
Source code in libs/kotaemon/kotaemon/loaders/utils/adobe.py
load_json ¶
load_excel ¶
Load excel file and convert to markdown
Source code in libs/kotaemon/kotaemon/loaders/utils/adobe.py
parse_table_paths ¶
Read the table stored in an excel file given the file path
Source code in libs/kotaemon/kotaemon/loaders/utils/adobe.py
parse_figure_paths ¶
Read and convert an image to base64 given the image path
Source code in libs/kotaemon/kotaemon/loaders/utils/adobe.py
generate_figure_captions ¶
Summarize several figures using GPT-4V. Args: vlm_endpoint (str): endpoint to the vision language model service figures (List): list of base64 images max_figures_to_process (int): the maximum number of figures will be summarized, the rest are ignored.
Returns:
Name | Type | Description |
---|---|---|
results |
List[str]
|
list of all figure captions and empty strings for |
List
|
ignored figures. |