Page MenuHomePhabricator

RemexHtmlComponent
ActivePublic

Members (5)

Watchers (2)

Details

Description

For issues relating to RemexHtml, an HTML 5 parser library for PHP.

See https://www.mediawiki.org/wiki/RemexHtml

Recent Activity

Thu, May 28

jnuche moved T426652: DOMException: Invalid Character Error from Untriaged to Apr–Jun 2026 on the Wikimedia-production-error board.
Thu, May 28, 3:38 PM · Content-Transform-Team (Work In Progress), RemexHtml, Wikimedia-production-error
MSantos moved T426652: DOMException: Invalid Character Error from Backlog to Needs Investigation on the Content-Transform-Team (Work In Progress) board.
Thu, May 28, 2:12 PM · Content-Transform-Team (Work In Progress), RemexHtml, Wikimedia-production-error
MSantos moved T426652: DOMException: Invalid Character Error from Backlog to Work In Progress on the Content-Transform-Team board.
Thu, May 28, 2:12 PM · Content-Transform-Team (Work In Progress), RemexHtml, Wikimedia-production-error

Mon, May 18

ssastry renamed T426652: DOMException: Invalid Character Error from Exception executing job: parsoidCachePrewarm User:Emdosis/sandbox revId=1246490609 pageId=76732332 page_touched=20260423193125 namespace=2 title=Emdosis/sandbox causeAction=view options=0 rootJobIsSelf=1 rootJobSignature=68e733c09e260fd51dabf1c0d3f90dfc0a to DOMException: Invalid Character Error.
Mon, May 18, 5:17 PM · Content-Transform-Team (Work In Progress), RemexHtml, Wikimedia-production-error
Aklapper added a project to T426652: DOMException: Invalid Character Error: RemexHtml.
Mon, May 18, 5:06 PM · Content-Transform-Team (Work In Progress), RemexHtml, Wikimedia-production-error

May 9 2026

Reedy removed a project from T393922: Make Parsoid/Zest/Remex safe for use with PHP8.4 Dom\Document: Patch-For-Review.
May 9 2026, 4:31 PM · MediaWiki-libs-Zest, Parsoid-Read-Views (Performance), Parsoid, RemexHtml
A_smart_kitten added a project to T393922: Make Parsoid/Zest/Remex safe for use with PHP8.4 Dom\Document: MediaWiki-libs-Zest.
May 9 2026, 3:07 PM · MediaWiki-libs-Zest, Parsoid-Read-Views (Performance), Parsoid, RemexHtml

Apr 7 2026

valerio.bozzolan closed T421529: Crash in DOMImplementation with php-dom installed: "ValueError: class_alias(): Argument #1 ($class) must be a user-defined class name, internal class name given" using MediaWiki 1.45.1 and PHP 8.2 / 8.3, a subtask of T393922: Make Parsoid/Zest/Remex safe for use with PHP8.4 Dom\Document, as Invalid.
Apr 7 2026, 9:59 PM · MediaWiki-libs-Zest, Parsoid-Read-Views (Performance), Parsoid, RemexHtml

Mar 31 2026

Maintenance_bot removed a project from T389712: Make MMLbase support trees: Patch-For-Review.
Mar 31 2026, 10:33 AM · MW-1.45-notes (1.45.0-wmf.13; 2025-08-05), MW-1.44-notes (1.44.0-wmf.28; 2025-05-06), RemexHtml, Math
gerritbot added a comment to T389712: Make MMLbase support trees.

Change #1137286 abandoned by Physikerwelt:

[mediawiki/extensions/Math@master] Make MMLbase support trees: Improve Visitor implementation

https://gerrit.wikimedia.org/r/1137286

Mar 31 2026, 9:59 AM · MW-1.45-notes (1.45.0-wmf.13; 2025-08-05), MW-1.44-notes (1.44.0-wmf.28; 2025-05-06), RemexHtml, Math
gerritbot added a comment to T388985: Add support for more complex linking in <math/> formulas.

Change #1128019 abandoned by Physikerwelt:

[mediawiki/extensions/Math@master] Add support for more complex linking in <math/> formulas

https://gerrit.wikimedia.org/r/1128019

Mar 31 2026, 9:59 AM · RemexHtml, Patch-For-Review, Math

Mar 29 2026

Physikerwelt updated the task description for T420668: RemexHTML should not add p for math elements within blockqoute.
Mar 29 2026, 3:10 PM · RemexHtml

Mar 27 2026

valerio.bozzolan added a subtask for T393922: Make Parsoid/Zest/Remex safe for use with PHP8.4 Dom\Document: T421529: Crash in DOMImplementation with php-dom installed: "ValueError: class_alias(): Argument #1 ($class) must be a user-defined class name, internal class name given" using MediaWiki 1.45.1 and PHP 8.2 / 8.3.
Mar 27 2026, 6:09 PM · MediaWiki-libs-Zest, Parsoid-Read-Views (Performance), Parsoid, RemexHtml

Mar 26 2026

Izno merged T421281: Gallery tag in bulleted list item fails to close bulleted list tag and generates empty gallery container into T11996: Multiline HTML elements in lists parsed to broken HTML.
Mar 26 2026, 1:24 AM · Parsoid, RemexHtml, Tidy, MediaWiki-Parser

Mar 19 2026

Physikerwelt updated the task description for T420668: RemexHTML should not add p for math elements within blockqoute.
Mar 19 2026, 10:48 PM · RemexHtml
Physikerwelt created T420668: RemexHTML should not add p for math elements within blockqoute.
Mar 19 2026, 10:48 PM · RemexHtml

Feb 5 2026

Aklapper added a comment to T342304: Request timeout in LuaError::getScriptTraceHtml() leads to shared memory corruption, segfault flood.

@tstarling: This task has been assigned to you a while ago. Is this still a valid (open) task which requires more work, or do you assume this got fixed by T389734?

Feb 5 2026, 10:30 AM · MediaWiki-Platform-Team (Radar), php-segfault, Excimer, RemexHtml, Wikimedia-production-error

Jan 30 2026

gerritbot added a comment to T393922: Make Parsoid/Zest/Remex safe for use with PHP8.4 Dom\Document.

Change #1233235 abandoned by C. Scott Ananian:

[mediawiki/services/parsoid@REL1_43] Support Dodo/PHP8.4 DOM implementations

Reason:

I don't think this is a good idea to backport.

https://gerrit.wikimedia.org/r/1233235

Jan 30 2026, 10:59 PM · MediaWiki-libs-Zest, Parsoid-Read-Views (Performance), Parsoid, RemexHtml

Jan 29 2026

gerritbot added a comment to T393922: Make Parsoid/Zest/Remex safe for use with PHP8.4 Dom\Document.

Change #1233229 merged by jenkins-bot:

[mediawiki/services/parsoid@REL1_44] Support Dodo/PHP8.4 DOM implementations

https://gerrit.wikimedia.org/r/1233229

Jan 29 2026, 10:57 PM · MediaWiki-libs-Zest, Parsoid-Read-Views (Performance), Parsoid, RemexHtml

Jan 26 2026

gerritbot added a comment to T393922: Make Parsoid/Zest/Remex safe for use with PHP8.4 Dom\Document.

Change #1233235 had a related patch set uploaded (by Reedy; author: C. Scott Ananian):

[mediawiki/services/parsoid@REL1_43] Support Dodo/PHP8.4 DOM implementations

https://gerrit.wikimedia.org/r/1233235

Jan 26 2026, 6:21 PM · MediaWiki-libs-Zest, Parsoid-Read-Views (Performance), Parsoid, RemexHtml
gerritbot added a project to T393922: Make Parsoid/Zest/Remex safe for use with PHP8.4 Dom\Document: Patch-For-Review.
Jan 26 2026, 6:00 PM · MediaWiki-libs-Zest, Parsoid-Read-Views (Performance), Parsoid, RemexHtml
gerritbot added a comment to T393922: Make Parsoid/Zest/Remex safe for use with PHP8.4 Dom\Document.

Change #1233229 had a related patch set uploaded (by Reedy; author: C. Scott Ananian):

[mediawiki/services/parsoid@REL1_44] Support Dodo/PHP8.4 DOM implementations

https://gerrit.wikimedia.org/r/1233229

Jan 26 2026, 6:00 PM · MediaWiki-libs-Zest, Parsoid-Read-Views (Performance), Parsoid, RemexHtml

Jan 20 2026

gerritbot added a comment to T414289: Use mrow elements for linking instead of anchor tags.

Change #1225176 merged by jenkins-bot:

[mediawiki/extensions/Math@master] Use mrow elements for linking instead of anchor tags

https://gerrit.wikimedia.org/r/1225176

Jan 20 2026, 4:20 PM · Patch-For-Review, RemexHtml, Math

Jan 19 2026

Physikerwelt added a comment to T414289: Use mrow elements for linking instead of anchor tags.

@FrederikHennecke1 that means Q4 26, With 153

Jan 19 2026, 9:30 AM · Patch-For-Review, RemexHtml, Math

Jan 18 2026

FrederikHennecke1 added a comment to T414289: Use mrow elements for linking instead of anchor tags.

Firefox 140.7.0esr (64-bit) on Debian 13.

Jan 18 2026, 1:41 PM · Patch-For-Review, RemexHtml, Math
FrederikHennecke1 added a comment to T414289: Use mrow elements for linking instead of anchor tags.

Firefox 140.7.0esr (64-bit) on Debian 13.

Jan 18 2026, 12:56 PM · Patch-For-Review, RemexHtml, Math
Physikerwelt added a comment to T414289: Use mrow elements for linking instead of anchor tags.

Interesting, what Firefox version are you using?

Jan 18 2026, 11:48 AM · Patch-For-Review, RemexHtml, Math

Jan 17 2026

FrederikHennecke1 added a comment to T414289: Use mrow elements for linking instead of anchor tags.

In the MathWG we defined that it should be possible to replace all mrow with a and stop outputting mrow at all see
https://codepen.io/physikerwelt/pen/XJKMMyo

Jan 17 2026, 4:53 PM · Patch-For-Review, RemexHtml, Math
Physikerwelt added a comment to T414289: Use mrow elements for linking instead of anchor tags.

In the MathWG we defined that it should be possible to replace all mrow with a and stop outputting mrow at all see
https://codepen.io/physikerwelt/pen/XJKMMyo

Jan 17 2026, 11:36 AM · Patch-For-Review, RemexHtml, Math
FrederikHennecke1 added a comment to T414289: Use mrow elements for linking instead of anchor tags.
<math xmlns="http://www.w3.org/1998/Math/MathML" class="mwe-math-element mwe-math-element-inline">
    <mrow data-mjx-texclass="ORD">
        <mstyle displaystyle="true" scriptlevel="0"><a href="https://en.wikipedia.org/wiki/Energy" title="energy">
                <mi>E</mi>
            </a>
            <mo stretchy="false">=</mo><a href="https://en.wikipedia.org/wiki/Mass" title="mass">
                <mi>m</mi>
            </a>
            <msup><a href="https://en.wikipedia.org/wiki/Speed_of_light" title="speed of light in vacuum">
                    <mi>c</mi>
                </a>
                <mrow data-mjx-texclass="ORD">
                    <mn>2</mn>
                </mrow>
            </msup>
        </mstyle>
    </mrow>
</math>
<br>
<math xmlns="http://www.w3.org/1998/Math/MathML" class="mwe-math-element mwe-math-element-inline">
    <mrow data-mjx-texclass="ORD">
        <mstyle displaystyle="true" scriptlevel="0">
                <mi>E</mi>
            <mo stretchy="false">=</mo>
                <mi>m</mi>
            <msup>
                    <mi>c</mi>
                <mrow data-mjx-texclass="ORD">
                    <mn>2</mn>
                </mrow>
            </msup>
        </mstyle>
    </mrow>
</math>

This is the rendering in Firefox. Chrome and Webkit look normal. But this issue is a bit wonky. I did screenshot it in my default Firefox browser(140.7.0esr ). This issue did not show up when I tried to compare it with BackstopJS so it may just be my local font.

Jan 17 2026, 1:20 AM · Patch-For-Review, RemexHtml, Math

Jan 12 2026

Physikerwelt added a comment to T414289: Use mrow elements for linking instead of anchor tags.

@FrederikHennecke1 thank you! I just left a note to https://github.com/mathjax/MathJax/issues/3484#issuecomment-3740249528 as I wasn't expecting this and want to avoid surprises in the future.

Jan 12 2026, 7:55 PM · Patch-For-Review, RemexHtml, Math
FrederikHennecke1 added a comment to T414289: Use mrow elements for linking instead of anchor tags.

@FrederikHennecke1 do you think it is a bug in MathJax that the title is not used? If yes, we should file an issue here https://github.com/mathjax/mathjax/issues

Jan 12 2026, 6:45 PM · Patch-For-Review, RemexHtml, Math
Physikerwelt added a comment to T414289: Use mrow elements for linking instead of anchor tags.

@FrederikHennecke1 do you think it is a bug in MathJax that the title is not used? If yes, we should file an issue here https://github.com/mathjax/mathjax/issues

Jan 12 2026, 12:16 PM · Patch-For-Review, RemexHtml, Math

Jan 11 2026

gerritbot added a project to T414289: Use mrow elements for linking instead of anchor tags: Patch-For-Review.
Jan 11 2026, 9:49 PM · Patch-For-Review, RemexHtml, Math
gerritbot added a comment to T414289: Use mrow elements for linking instead of anchor tags.

Change #1225176 had a related patch set uploaded (by FrederikHennecke1; author: FrederikHennecke1):

[mediawiki/extensions/Math@master] Use mrow elements for linking instead of anchor tags

https://gerrit.wikimedia.org/r/1225176

Jan 11 2026, 9:49 PM · Patch-For-Review, RemexHtml, Math
FrederikHennecke1 created T414289: Use mrow elements for linking instead of anchor tags.
Jan 11 2026, 9:45 PM · Patch-For-Review, RemexHtml, Math

Jan 9 2026

PrimeHunter created T414152: In HTML table, added paragraph code for a cell may exclude the last source line.
Jan 9 2026, 1:39 AM · MediaWiki-Parser

Dec 28 2025

Physikerwelt closed T389461: wikitexvc should create a tree instead of a string, a subtask of T388985: Add support for more complex linking in <math/> formulas, as Resolved.
Dec 28 2025, 8:39 PM · RemexHtml, Patch-For-Review, Math
Physikerwelt closed T389461: wikitexvc should create a tree instead of a string as Resolved.

I think this is complete.

Dec 28 2025, 8:39 PM · RemexHtml, Math
Physikerwelt added a comment to T388985: Add support for more complex linking in <math/> formulas.

@FrederikHennecke1 the W3C mathwg discussion continued. Eventually, only mrow/a elements should have links. I suggest we add mrows when needed, and don't switch to a elements right now. As soon as Chrome has native support for links, we rename all mrow elements to a elements (which is a very minimal change with a lot of effect).

Dec 28 2025, 8:29 PM · RemexHtml, Patch-For-Review, Math

Dec 27 2025

FrederikHennecke1 added a comment to T388985: Add support for more complex linking in <math/> formulas.

I just tried to find all formulas in enwiki with QIDs with a short python script but I could only find 161 unique formulas. Is this correct? If that is the case I think that we don't necessarily have to rewrite the cache to cache these formulas. I also tested the performance of with / without caching and got the following results:

Dec 27 2025, 8:29 PM · RemexHtml, Patch-For-Review, Math

Dec 19 2025

Reedy removed a project from T393922: Make Parsoid/Zest/Remex safe for use with PHP8.4 Dom\Document: Patch-For-Review.
Dec 19 2025, 5:12 PM · MediaWiki-libs-Zest, Parsoid-Read-Views (Performance), Parsoid, RemexHtml

Nov 13 2025

thcipriani moved T402444: DOMException: Invalid Character Error from Oct-Dec 2025 to Jul–Sep 2025 on the Wikimedia-production-error board.
Nov 13 2025, 4:40 PM · RemexHtml, Wikimedia-production-error
thcipriani moved T402444: DOMException: Invalid Character Error from Untriaged to Oct-Dec 2025 on the Wikimedia-production-error board.
Nov 13 2025, 4:28 PM · RemexHtml, Wikimedia-production-error

Nov 12 2025

Physikerwelt added a comment to T388985: Add support for more complex linking in <math/> formulas.

@FrederikHennecke1 It's still significantly faster with caching. So we need to continue with caching.

Nov 12 2025, 7:06 AM · RemexHtml, Patch-For-Review, Math

Oct 11 2025

Bugreporter2 updated the task description for T388985: Add support for more complex linking in <math/> formulas.
Oct 11 2025, 1:40 AM · RemexHtml, Patch-For-Review, Math
Bugreporter2 updated the task description for T388985: Add support for more complex linking in <math/> formulas.
Oct 11 2025, 1:39 AM · RemexHtml, Patch-For-Review, Math
FrederikHennecke1 added a comment to T388985: Add support for more complex linking in <math/> formulas.

Thank you. That seems to better match my expectations. I'll try to run your github on my machine to see if I can match those values. In https://ceur-ws.org/Vol-1785/W48.pdf we show the rendering time over the input length (Fig. 3). I find that a bit more intuitive (but more complicated) than the median.

Oct 11 2025, 1:21 AM · RemexHtml, Patch-For-Review, Math

Oct 10 2025

Physikerwelt added a comment to T389712: Make MMLbase support trees.

It was not needed up to now I think. Maybe there would be a performance gain if we used a read onlyclass https://en.wikipedia.org/wiki/Immutable_object?wprov=sfti1#PHP so you can introduce it if needed but it is a bit nicer to avoid having that

Oct 10 2025, 6:48 PM · MW-1.45-notes (1.45.0-wmf.13; 2025-08-05), MW-1.44-notes (1.44.0-wmf.28; 2025-05-06), RemexHtml, Math
JeanCASPAR added a comment to T389712: Make MMLbase support trees.

Is there a rationale for not having a MMLleaf::setText method, or is it an oversight ? This forbid post-processing of the produced tree, for example.

Oct 10 2025, 5:02 PM · MW-1.45-notes (1.45.0-wmf.13; 2025-08-05), MW-1.44-notes (1.44.0-wmf.28; 2025-05-06), RemexHtml, Math