Extract items from 10-K as HTML Snippet with Formatting Intact

I'm working with the ten_k.parse() function to extract the risk factor section from 10-K filings. However, for my analysis, I need to preserve the original HTML formatting, particularly bold and italic tags, so I can accurately identify and count individual risk factors (e.g., those introduced with formatted subheadings).

Would it be possible to support an additional output format like format="html" in get_section, which returns the section as a raw HTML snippet with the tags intact? Alternatively, is there a recommended way to recover the exact HTML corresponding to a parsed section?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Extract items from 10-K as HTML Snippet with Formatting Intact #57

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Extract items from 10-K as HTML Snippet with Formatting Intact #57

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions