Parse HTML

Currently, the Markdown package makes no effort to parse HTML content, contrary to markdown and YAML:

> > How is one supposed to define a renderer for this? The `inlineHtmlTag` only gets one argument, the html tag, and I see no way to get at the content between both tags to put braces around the argument?
> 
> With difficulty. You would need to scan ahead hoping for a closing HTML tag. There is no attempt to provide comprehensive support for rendering HTML elements at this moment.
> 
> We might add an option to enable a new parsing regime for HTML that would produce more useful renderers for all aspects of HTML code, similarly to YAML. However, there's currently no detailed proposal (see e.g. https://github.com/Witiko/markdown/discussions/517) for this feature, which would be required to start the implementation.
> 
> Since our parser already [differentiates between different types of HTML content](https://github.com/Witiko/markdown/blob/746cfc56b715f46e656746a620579348b322b36e/markdown.dtx#L32848-L32854) following [CommonMark's model of HTML](https://spec.commonmark.org/0.31.2/#html-blocks), we could start by exposing the corresponding PEG parsers as individual renderers. While it's unclear whether this would be sufficient for rendering HTML in TeX, it would be a start and definitely much less work than including a full-blown HTML parser in addition to the current CommonMark parser.

 _Originally posted by @u-fischer and @Witiko in [#597](https://github.com/Witiko/markdown/issues/597#issuecomment-3554531819)_

### Tasks
- [ ] Propose and implement renderers that correspond to [CommonMark's model of HTML](https://spec.commonmark.org/0.31.2/#html-blocks).
    Produce these renderers if a corresponding option has been enabled.
- [ ] Propose and implement renderers that correspond to HTML nodes.
    Produce these renderers if a corresponding option has been enabled.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parse HTML #606

Tasks

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Parse HTML #606

Description

Tasks

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions