FunctionsExtractorPdf to htmlUtil
Page Number Functions
ai-research-agent / extractor/pdf-to-html/util/page-number-functions
findFirstPage()
function findFirstPage(pageIndexNumMap: any): any;Defined in: src/extractor/pdf-to-html/util/page-number-functions.js:63
Checks when the page number first begins and returns it
Parameters
| Parameter | Type | Description |
|---|---|---|
|
| object |
Returns
any
For example: pageIndex: 10, pageNum: 3
findPageNumbers()
function findPageNumbers(
pageIndexNumMap: any,
pageIndex: number,
items: any[]): any;Defined in: src/extractor/pdf-to-html/util/page-number-functions.js:48
Searches both top and bottom area and returns an object
Parameters
| Parameter | Type | Description |
|---|---|---|
|
| object |
|
| index of the page |
|
| textContent.items |
Returns
any
pageIndexNumMap object
removePageNumber()
function removePageNumber(textContent: any, pageNum: number): any;Defined in: src/extractor/pdf-to-html/util/page-number-functions.js:104
Return textContent with items that have pageNum removed
Parameters
| Parameter | Type | Description |
|---|---|---|
|
| object |
|
|
Returns
any
filteredContent - textContent without items that have pageNum