This table will serve as a content storage layer for rendered HTML of abstract articles in particular languages, for cross-wiki reads embedded within Wikipedias (WE 2.3.8). See T422619 and T422620 (and the rest of the task tree) for more context.
- Should this table be replicated to wiki replicas (does it not contain private data)?
- Yes.
- Will you be doing cross-joins with the wiki metadata?
- No.
- Size of the table (number of rows expected).
- ~12k by end of 2026; < 1 M by end of 2027, by which time this approach will be reviewed and maybe replaced. Note that each row has one field with a potentially very heavy blob.
- Expected growth per year (number of rows).
- A function of the number of languages and articles allowed into the system. Controlled to some extent by the team's roll-out cadence (and popularity).
- Expected amount of queries, both writes and reads (per minute, per hour...per day, any of those are ok).
- By end-2026, ~1 write per minute & ~1 read per second.
- By end-2027, ~2 writes per second & ~10 reads per second.
- Examples of queries that will be using the table.
- https://gerrit.wikimedia.org/g/mediawiki/extensions/WikiLambda/+/refs/heads/master/includes/AWStorage/DBAWArticleStore.php
- Only very simple SELECTs/REPLACEs.
- The release plan for the feature (are there specific wikis you'd like to test first etc).
- First to abstract.wikipedia.org and test.wikipedia.org in Q4; then to 1–2 other wikis in Q1, and a handful more in Q2 if it goes well.
