Extract URL or HTML to main content, based on Readability with improved version
using 100+ custom adapters for major websites.
Strips to basic HTML for reading mode or saving research notes.
Youtube - get full transcript for video if detected a youtube video.
PDF - Extracts formatted text from PDF with parsing of headings, page headers,
footnotes, and adding linebreaks based on standard deviation of range text height.
Parameters
urlOrDoc: string | Document
url or dom object with article content
Optionaloptions: { Â Â Â Â images: boolean; Â Â Â Â links: boolean; Â Â Â Â formatting: boolean; Â Â Â Â absoluteURLs: boolean; Â Â Â Â timeout: number; } = {}
🚜📜 Tractor the Text Extractor