Skip to main content

extract-cite

ai-research-agent / extractor/html-to-cite/extract-cite

Extract

extractCite()

function extractCite(document, options): object

📚💎 Extract Expert Excerpt

Extract author, date, source, and title from HTML using meta tags and common class names. Validates human name from author string to check against common list of 90k first names, last names,and organizations to infer if it should be reversed starting by author last name (accounting for affixes/titles), since organizations are not reversed. Article Extraction Benchmark

Parameters

ParameterTypeDescription

document

Document

dom object or html string with article content

options

{}

Returns

object

An object containing extracted citation information.

NameType
authorstring
author_citestring
author_shortstring
datestring
sourcestring
titlestring

Author

ai-research-agent (2024)