FunctionsExtractorPdf to htmlUtil

Page Number Functions

ai-research-agent / extractor/pdf-to-html/util/page-number-functions

findFirstPage()

function findFirstPage(pageIndexNumMap: any): any;

Defined in: src/extractor/pdf-to-html/util/page-number-functions.js:63

Checks when the page number first begins and returns it

Parameters

ParameterTypeDescription

pageIndexNumMap

any

object

Returns

any

For example: pageIndex: 10, pageNum: 3


findPageNumbers()

function findPageNumbers(
   pageIndexNumMap: any, 
   pageIndex: number, 
   items: any[]): any;

Defined in: src/extractor/pdf-to-html/util/page-number-functions.js:48

Searches both top and bottom area and returns an object

Parameters

ParameterTypeDescription

pageIndexNumMap

any

object

pageIndex

number

index of the page

items

any[]

textContent.items

Returns

any

pageIndexNumMap object


removePageNumber()

function removePageNumber(textContent: any, pageNum: number): any;

Defined in: src/extractor/pdf-to-html/util/page-number-functions.js:104

Return textContent with items that have pageNum removed

Parameters

ParameterTypeDescription

textContent

any

object

pageNum

number

Returns

any

filteredContent - textContent without items that have pageNum

On this page