html-to-content

Documentation / extractor/html-to-content/html-to-content

ExtractedContent

Property	Type	Description	Defined in
`title`	`string`	The title of the content	packages/ai-research-agent/src/extractor/html-to-content/html-to-content.js:84
`author_cite`	`string`	The full citation for the author	packages/ai-research-agent/src/extractor/html-to-content/html-to-content.js:85
`author_short`	`string`	A shortened version of the author's name	packages/ai-research-agent/src/extractor/html-to-content/html-to-content.js:86
`author`	`string`	The author's name	packages/ai-research-agent/src/extractor/html-to-content/html-to-content.js:87
`date`	`string`	The publication date	packages/ai-research-agent/src/extractor/html-to-content/html-to-content.js:88
`source`	`string`	The source of the content	packages/ai-research-agent/src/extractor/html-to-content/html-to-content.js:89
`html`	`string`	The extracted main content in HTML format	packages/ai-research-agent/src/extractor/html-to-content/html-to-content.js:90

function extractContentAndCite(documentOrHTML: any, options: object): any;

Extracts the main content and citation information from a document or HTML string

Parameter	Type	Description
`documentOrHTML`	`any`	The document or HTML string to extract content from
`options`	{ `images`: `boolean`; `links`: `boolean`; `formatting`: `boolean`; `url`: `string`; `useExtractor2`: `boolean`; }	Optional configuration options
`options.images`	`boolean`	default=true - Whether to include images in the extracted content
`options.links`	`boolean`	default=true - Whether to include links in the extracted content
`options.formatting`	`boolean`	default=true - Whether to preserve formatting in the extracted content
`options.url`	`string`	The URL of the original document, if available, for absolutify-ing URLs
`options.useExtractor2`	`boolean`	default=false - false uses Mozilla Readability, true uses Postlight Mercury. then use the alternate if the first returns less than 200 characters

any

The extracted content and citation information