Closed Bug 922018 Opened 11 years ago Closed 10 years ago

Peacekeeper domDynamicCreationInnerHTML slower than in Chrome

Tracking

()

Status:

RESOLVED WORKSFORME

People

(Reporter: jandem, Unassigned)

References

(Depends on 1 open bug, Blocks 1 open bug)

Details

Attachments

(2 files, 2 obsolete files)

Testcase 11 years ago Jan de Mooij [:jandem] 952 bytes, text/html		Details
WIP cache 11 years ago Olli Pettay [:smaug][bugs@pettay.fi] 6.84 KB, patch		Details \| Diff \| Splinter Review
innerHTML cache 11 years ago Olli Pettay [:smaug][bugs@pettay.fi] 8.71 KB, patch		Details \| Diff \| Splinter Review
v3 11 years ago Olli Pettay [:smaug][bugs@pettay.fi] 8.71 KB, patch		Details \| Diff \| Splinter Review

Jan de Mooij [:jandem]

Reporter

Description

•

11 years ago

Attached file Testcase — Details

All this test does is:

        this.hiddenPlayground.innerHTML = ...;

Numbers:

Chrome 31:    1280 ms
Safari 6.0.5: 1292 ms
Nightly 27:   1764 ms

bz: sorry for flooding you with needinfo? requests, but I've filed bugs on most of the Peacekeeper tests now except some of the DOM/jQuery ones.

Flags: needinfo?(bzbarsky)

Olli Pettay [:smaug][bugs@pettay.fi]

Comment 1

•

11 years ago

Hmm, we used to be faster than others with innerHTML, at some point.
But I guess that depends on what is being parsed.

Olli Pettay [:smaug][bugs@pettay.fi]

Comment 2

•

11 years ago

I don't see anything special in the profile. Just many things taking a bit too much time.
*Element::SetAttr and stuff under it and also 
HTML parser seems to be quite malloc-happy: 
nsHTML5HTMLAttributes dtor takes significant time and nsHtml5TreeBuilder::appendToCurrentNodeAndPushElementMayFoster tends to malloc quite a bit.
Some object recycling might make sense.

Boris Zbarsky [:bzbarsky]

Comment 3

•

11 years ago

Yeah... at least on Mac, free is 20% of the time here.  malloc is 15%; realloc is 4%.

Olli, how insane would it be to have an innerHTML cache mapping strings to DOM fragments that we could just clone?  I wonder how often it would get hit in non-benchmark situations...

Flags: needinfo?(bzbarsky) → needinfo?(bugs)

Boris Zbarsky [:bzbarsky]

Updated

•

11 years ago

Flags: needinfo?(hsivonen)

Olli Pettay [:smaug][bugs@pettay.fi]

Comment 4

•

11 years ago

Henri, I'll need some help figuring out how to optimize the parser, given that the
java-c++ translation isn't quite working atm.

Flags: needinfo?(bugs)

Olli Pettay [:smaug][bugs@pettay.fi]

Comment 5

•

11 years ago

Caching DOM fragments...  I wonder how to make the hashtable look up fast, if strings are
very similar.
Perhaps just cache 10 fragments or something?
I'll try that, but the parser needs still some optimizing.

Olli Pettay [:smaug][bugs@pettay.fi]

Comment 6

•

11 years ago

A super simple cache gives 50% speed up here, but it is not clear to me whether it is useful
outside benchmarks. It might be. Trying to find some innerHTML heavy code.

Olli Pettay [:smaug][bugs@pettay.fi]

Comment 7

•

11 years ago

Attached patch WIP cache (obsolete) — Details — Splinter Review

a simple cache. not sure we want this kind of thing, but maybe it wouldn't 
cause harm.
https://tbpl.mozilla.org/?tree=Try&rev=1ef2088d9c9f

Olli Pettay [:smaug][bugs@pettay.fi]

Comment 8

•

11 years ago

Attached patch innerHTML cache (obsolete) — Details — Splinter Review

https://tbpl.mozilla.org/?tree=Try&rev=4c9bd8fb7db6

I was thinking to move this to parser level, but that becomes tricky because
of different notifying based on caching or not caching.

Attachment #815178 - Attachment is obsolete: true

Olli Pettay [:smaug][bugs@pettay.fi]

Comment 9

•

11 years ago

Attached patch v3 — Details — Splinter Review

https://tbpl.mozilla.org/?tree=Try&rev=3e162e5f5460

Attachment #815500 - Attachment is obsolete: true

Olli Pettay [:smaug][bugs@pettay.fi]

Comment 10

•

11 years ago

Hmm, what caching is hard, since things like img and video start loading and may trigger events.

Jan de Mooij [:jandem]

Reporter

Comment 11

•

11 years ago

(In reply to Olli Pettay [:smaug] from comment #10)
> Hmm, what caching is hard, since things like img and video start loading and
> may trigger events.

It's not possible to detect these cases? Else we should work on optimizing the HTML parser..

Olli Pettay [:smaug][bugs@pettay.fi]

Comment 12

•

11 years ago

We should definitely optimize HTML parser in any case.
I need to still figure out a sane way to do that since the current
Java->C++ translation makes coding harder than usually.

Henri Sivonen (:hsivonen)

Comment 13

•

11 years ago

(In reply to Olli Pettay [:smaug] from comment #12)
> We should definitely optimize HTML parser in any case.
> I need to still figure out a sane way to do that since the current
> Java->C++ translation makes coding harder than usually.

If you mean recycling the attribute holders, what makes it hard is the reuse of the off-the-main-thread code on the main thread--not the Java stuff. The Java stuff *already* has the capability to reuse attribute holders.

In general, we could call malloc a lot less for innerHTML if we had a distinct main-thread-oriented HTML parser similar to the one initially landed before Firefox 3.6.

The Java code supports pluggable tree builders. However, to avoid virtual calls, that abstraction is flattened away in C++. If we want to plug in a tree building back end that doesn't generate tree ops but actually builds the tree right away, we could either:

 * Use the translator infrastructure to generate a second copy of the parser and avoid virtual calls.

 * Introduce virtual calls to the C++ side and have two subclasses of the tree builder.

Flags: needinfo?(hsivonen)

Henri Sivonen (:hsivonen)

Comment 14

•

11 years ago

That is to say, I think adding some sort of attribute holder reuse to the off-the-main-thread architecture would be a lot less of a fix than getting rid of the entire off-the-main-thread architecture for innerHTML and DOMParser. It wouldn't even be particularly hard.

What we need to decide is whether we want to avoid adding a bunch of frequent virtual calls or whether we want to avoid the code size cost of having a second copy of parser core.

Olli Pettay [:smaug][bugs@pettay.fi]

Comment 15

•

11 years ago

(In reply to Henri Sivonen (:hsivonen) from comment #13)
> If you mean recycling the attribute holders, what makes it hard is the reuse
> of the off-the-main-thread code on the main thread--not the Java stuff. The
> Java stuff *already* has the capability to reuse attribute holders.

I mean doing low level optimizations in general, so that we can for example reduce use of malloc/free everywhere.

Henri Sivonen (:hsivonen)

Comment 16

•

11 years ago

(In reply to Olli Pettay [:smaug] from comment #15)
> I mean doing low level optimizations in general, so that we can for example
> reduce use of malloc/free everywhere.

But even before we get to that level of optimization, we should first get the off-the-main-thread machinery out of the way. It's the main cause of malloc/free in the parser. If we get rid of a lot of malloc/free, low-level optimization might not make sense.

Olli Pettay [:smaug][bugs@pettay.fi]

Updated

•

11 years ago

Depends on: 934565

Henri Sivonen (:hsivonen)

Updated

•

10 years ago

Depends on: 959150

Olli Pettay [:smaug][bugs@pettay.fi]

Comment 17

•

10 years ago

On linux I get ~1450 both in Nightly and Chromium-32.

Emanuel Hoogeveen [:ehoogeveen]

Comment 18

•

10 years ago

On Windows I see the following. The variance figures are just ballpark estimates.

Chrome 32.0.1700.102:   1043 ms +/- 10 ms
Chrome 33.0.1750.46:    1004 ms +/-  5 ms
Chrome 34.0.1809.0:      962 ms +/-  5 ms
Firefox 29.0 2014-01-29: 882 ms +/- 20 ms

Olli Pettay [:smaug][bugs@pettay.fi]

Comment 19

•

10 years ago

I'd say this is fixed, but will get better once bug 959150 lands.

Status: NEW → RESOLVED

Closed: 10 years ago

Resolution: --- → FIXED

Ryan VanderMeulen [:RyanVM]

Updated

•

10 years ago

Resolution: FIXED → WORKSFORME

You need to log in before you can comment on or make changes to this bug.