Skip to main content

docx-to-content

Documentation / extractor/url-to-content/docx-to-content

Extract​

convertDOCXToHTML()​

function convertDOCXToHTML(input: string | ArrayBuffer | Blob | File, options?: DocxOptions): Promise<string>;

Defined in: extractor/url-to-content/docx-to-content.js:54

Converts a DOCX document to HTML

Parameters​

ParameterTypeDescription

input

string | ArrayBuffer | Blob | File

DOCX input to convert

options?

DocxOptions

Conversion options

Returns​

Promise<string>

The converted HTML

Throws​

If conversion fails

Example​

const html = await convertDOCXToHTML('https://example.com/doc.docx');
const html = await convertDOCXToHTML(fileInput.files[0]);

Other​

DocxOptions​

Defined in: extractor/url-to-content/docx-to-content.js:5

Properties​

PropertyTypeDescriptionDefined in

imgPath?

string

Base path for image resources

extractor/url-to-content/docx-to-content.js:8

includeStyles?

boolean

Whether to include document styles

extractor/url-to-content/docx-to-content.js:7

preserveShapes?

boolean

Whether to preserve shape elements

extractor/url-to-content/docx-to-content.js:6


ParagraphStyle​

Defined in: extractor/url-to-content/docx-to-content.js:220

Properties​

PropertyTypeDescriptionDefined in

alignment?

string

Text alignment (left, right, center, justify)

extractor/url-to-content/docx-to-content.js:221

indentation?

string

Paragraph indentation

extractor/url-to-content/docx-to-content.js:223

keepNext?

boolean

Keep with next paragraph

extractor/url-to-content/docx-to-content.js:224

pageBreakBefore?

boolean

Force page break before

extractor/url-to-content/docx-to-content.js:225

spacing?

string

Line spacing

extractor/url-to-content/docx-to-content.js:222


RunStyle​

Defined in: extractor/url-to-content/docx-to-content.js:229

Properties​

PropertyTypeDescriptionDefined in

bold?

boolean

Bold text

extractor/url-to-content/docx-to-content.js:230

color?

string

Text color

extractor/url-to-content/docx-to-content.js:233

font?

string

Font family

extractor/url-to-content/docx-to-content.js:236

highlight?

string

Highlight color

extractor/url-to-content/docx-to-content.js:234

italic?

boolean

Italic text

extractor/url-to-content/docx-to-content.js:231

size?

string

Font size

extractor/url-to-content/docx-to-content.js:235

underline?

boolean

Underlined text

extractor/url-to-content/docx-to-content.js:232


StyleConfig​

Defined in: extractor/url-to-content/docx-to-content.js:13

Properties​

PropertyTypeDescriptionDefined in

block

boolean

If true, element is rendered as block

extractor/url-to-content/docx-to-content.js:14

class?

string

CSS class name

extractor/url-to-content/docx-to-content.js:18

element

string

HTML element name

extractor/url-to-content/docx-to-content.js:16

heading?

boolean

If true, element is a heading

extractor/url-to-content/docx-to-content.js:15

xmlName?

string

DOCX XML element name

extractor/url-to-content/docx-to-content.js:17