Skip to main content

extract-cite

Documentation / extractor/html-to-cite/extract-cite

Extract

extractCite()

function extractCite(document: Document, options: object): object;

Defined in: extractor/html-to-cite/extract-cite.js:25

📚💎 Extract Expert Excerpt

Extract author, date, source, and title from HTML using meta tags and common class names. Validates human name from author string to check against common list of 90k first names, last names,and organizations to infer if it should be reversed starting by author last name (accounting for affixes/titles), since organizations are not reversed. Article Extraction Benchmark

Parameters

ParameterTypeDescription

document

Document

dom object or html string with article content

options

{ }

Returns

object

An object containing extracted citation information.

NameTypeDefined in

author

string

extractor/html-to-cite/extract-cite.js:21

author_cite

string

extractor/html-to-cite/extract-cite.js:21

author_short

string

extractor/html-to-cite/extract-cite.js:21

date

string

extractor/html-to-cite/extract-cite.js:21

source

string

extractor/html-to-cite/extract-cite.js:21

title

string

extractor/html-to-cite/extract-cite.js:21

Author

ai-research-agent (2024)