chore(l10n): extract strings, add docs, load locales on demand by LeoMcA · Pull Request #452 · mdn/fred

LeoMcA · 2025-07-24T18:29:09Z

Gets things to a stage where all strings are - in theory - l10n-able. Next steps will be improving the l10n experience, and further optimising which strings we ship to the client.

The added README is a good source for some of what's going on here.

github-actions · 2025-07-24T18:31:28Z

ba08f2e was deployed to: https://fred-pr452.review.mdn.allizom.net/

caugner · 2025-08-19T11:36:04Z

Converted to draft, as it has merge conflicts.

bcolsson

I understand that more work is happening around localization at some point in the future, but I noticed some major issues that could cause headaches down the road, so adding comments. I'd suggest making changes before asking localizers to translate these. (If the expectation is to use Pontoon, then the complicated Fluent logic wouldn't be supported well either.)

bcolsson · 2025-09-03T00:04:17Z

l10n/template.ftl

+footer-copyright = Portions of this content are ©1998–{ $year } by individual mozilla.org contributors. Content available under <a data-l10n-name="cc">a Creative Commons license</a>.
+search-modal-site-search = Site search for <em>{ $query }</em>
+site-search-search-stats = Found { $results } documents.
+site-search-suggestion-matches =


While this is technically valid fluent it's best to avoid this pattern. This should be two strings - otherwise you can't guarantee that all patterns will have translations (reference).

Typically you want to remove the logic from the strings themselves, so it's preferable to have a -greaterthan and a -equals string then have the logic happening outside of fluent to choose the correct string.

bcolsson · 2025-09-03T00:12:56Z

l10n/template.ftl

+compat-support-flags =
+    { NUMBER($has_added) ->
+        [one] From version { $version_added }
+       *[other] { "" }
+    }{ $has_last ->
+        [one]
+            { NUMBER($has_added) ->
+               *[zero] Until { $versionLast } users
+                [one] { " " }until { $versionLast } users
+            }
+       *[zero]
+            { NUMBER($has_added) ->
+               *[zero] Users
+                [one] { " " }users
+            }
+    }
+    { " " }must explicitly set the <code data-l10n-name="name">{ $flag_name }</code>{ " " }
+    { $flag_type ->
+       *[preference] preference
+        [runtime_flag] runtime flag
+    }{ NUMBER($has_value) ->
+        [one] { " " }to <code data-l10n-name="value">{ $flag_value }</code>
+       *[other] { "" }
+    }{ "." }
+    { $flag_type ->
+        [preference] To change preferences in { $browser_name }, visit { $browser_pref_url }.
+       *[other] { "" }
+    }


This is untranslatable. Even in English I barely understand what a final string would look like, so compounding translations with different word order, you'll have a very difficult time to get something comprehensible out of this.

This should be split into multiple strings. E.g.
string-a = From version { version }
string-b = From version { version } until {version} users
ETc.

bcolsson · 2025-09-03T00:14:36Z

l10n/template.ftl

+compat-experimental = Experimental
+compat-nonstandard = Non-standard
+compat-no = No
+compat-support-full = Full support


I'm finding this and multiple other strings with the exact same ID. You'll need to find a way to avoid duplication.

bcolsson · 2025-09-03T00:17:54Z

l10n/template.ftl

+compat-legend-preview = In development. Supported in a pre-release version.
+compat-legend-no = { compat-support-no }
+compat-legend-unknown = Compatibility unknown
+compat-legend-experimental = { compat-experimental }. Expect behavior to change in the future.


I sort of understand why you want to use a reference here and below, but there is the potential for localization issues here. It'd be simpler for localizers to just have the word instead of a message reference.

LeoMcA · 2026-02-03T17:22:45Z

@bcolsson thanks for your comments, once we've got the mechanics of our string scraping sorted out in this PR, we'll open others to resolve what you've noted, and make some of the auto-generated ids more meaningful.

@caugner I've updated this PR in line with what we discussed a little while ago in Berlin: removing the ability to add strings without defining an id. I temporarily used some of that logic in the migration script to update the current id-less instances to ones with ids.

Commits are mostly atomic - they may not all stand alone, because for some of them I simply split up my previous large commit more logically - but should be much nicer to review than before.

caugner · 2026-02-03T20:25:41Z

package.json

+    "l10n:extract": "node l10n/parser/extract.js",
+    "l10n:lint": "node l10n/parser/extract.js --lint"


How about:

Suggested change

"l10n:extract": "node l10n/parser/extract.js",

"l10n:lint": "node l10n/parser/extract.js --lint"

"l10n": "node l10n/parser/extract.js",

"l10n:lint": "node l10n/parser/extract.js --lint"

Rather than extract.js maybe it could be called l10n/cli/index.js?

caugner · 2026-02-03T20:27:24Z

server.js

+          const locale = req.path.split("/")[1];
+          if (locale && /^q[a-t][a-z]$/.test(locale)) {
+            req.path = req.path.replace(locale, "en-US");
+          }


I don't understand what this does, can you explain in a code comment?

caugner · 2026-02-03T20:29:43Z

l10n/template.ftl

Is this intentionally not alphabetically ordered?

caugner · 2026-02-03T20:30:26Z

symmetric-context/both.js

 * Runs on either client or server,
 * and returns the client or server context respectively
- * @returns {import("./types.js").SymmetricContext}
+ * @returns {import("./types.js").SymmetricContext | undefined}


Should the JSDoc comment explain when this is undefined?

caugner · 2026-02-03T20:31:29Z

l10n/parser/transform.js

+ * @import { TextElement } from "@fluent/syntax";
+ */
+
+class AccentTransformer extends Transformer {


Can you add a JSDoc comment explaining what this transformer does?

caugner · 2026-02-06T17:31:42Z

l10n/parser/extractor.js

+export async function extract() {
+  const manualStrings = await readFile(
+    fileURLToPath(import.meta.resolve("../locales/en-US.ftl")),
+    "utf8",
+  );
+  const fluentResource = parse(manualStrings, {});
+
+  const project = new Project({});
+  project.addSourceFilesAtPaths(
+    path.join(__dirname, "..", "..", "components", "**", "*.js"),
+  );
+
+  /** @type {Map<string, string>} */
+  const map = new Map();
+
+  for (const file of project.getSourceFiles()) {
+    for (const taggedTemplate of file.getDescendantsOfKind(
+      SyntaxKind.TaggedTemplateExpression,
+    )) {
+      const tagNode = taggedTemplate.getTag();
+      if (Node.isCallExpression(tagNode)) {
+        // e.g. this.l10n("foobar")`barfoo`
+        const expr = tagNode.getExpression();
+        if (Node.isPropertyAccessExpression(expr) && isL10nTag(expr)) {
+          const [arg] = tagNode.getArguments();
+          if (Node.isStringLiteral(arg)) {
+            const key = arg.getLiteralValue();
+            const value = getLiteralValue(taggedTemplate);
+            map.set(key, value);
+          }
+        }
+      }
+    }
+  }


Could we split this up a bit, to make some parts of it unit-testable?

caugner · 2026-02-06T17:33:15Z

l10n/locales/.gitignore

Missing newline at EOF.

caugner · 2026-02-06T17:33:21Z

l10n/fluent.js

    let message;

-    if (this.locale === "qa") {
+    if (this.locale === "qai") {


What's qa vs qai?

caugner · 2026-02-06T17:36:16Z

l10n/README.md

+- `qaa`: "accented" locale: adds accents to all characters, duplicates some vowels to create longer strings, wraps string in square brackets to help detect truncation
+- `qai`: "id" locale: replaces strings with their identifiers, wrapped in square brackets
+
+The `qai` locale works all the time, the `qaa` locale must be manually generated with `node ./parser/transform.js`


Maybe make node ./parser/transform.js a script in package.json?

caugner · 2026-02-06T17:41:54Z

l10n/template.ftl

+article-footer-last-modified = This page was last modified on <time data-l10n-name="date">{ $date }</time> by <a data-l10n-name="contributors">MDN contributors</a>.
+article-footer-source-title = Folder: { $folder } (Opens in a new tab)
+baseline-asterisk = Some parts of this feature may have varying levels of support.
+baseline-high-extra = This feature is well established and works across many devices and browser versions. It’s been available across browsers since { $date }.
+baseline-low-extra = Since { $date }, this feature works across the latest devices and browser versions. This feature might not work in older devices or browsers.
+baseline-not-extra = This feature is not Baseline because it does not work in some of the most widely-used browsers.
+baseline-supported-in = Supported in { $browsers }
+baseline-unsupported-in = Not widely supported in { $browsers }
+baseline-supported-and-unsupported-in = Supported in { $supported }, but not widely supported in { $unsupported }
+homepage-hero-title = Resources for Developers,<br> by Developers
+homepage-hero-description = Documenting <a data-l10n-name="css">CSS</a>, <a data-l10n-name="html">HTML</a>, and <a data-l10n-name="js">JavaScript</a>, since 2005.
+not-found-title = Page not found
+not-found-description = Sorry, the page <code data-l10n-name="url">{ $url }</code> could not be found.
+not-found-fallback-english = <strong data-l10n-name="strong">Good news:</strong> The page you requested exists in <em data-l10n-name="em">English</em>.
+not-found-fallback-search = The page you requested doesn't exist, but you could try a site search for:
+not-found-back = Go back to the home page


What would be nice is if it detected multiple lines with common prefix and added a line between them, for readability.

LeoMcA requested review from a team and mdn-bot as code owners July 24, 2025 18:29

LeoMcA temporarily deployed to review July 24, 2025 18:29 — with GitHub Actions Inactive

argl temporarily deployed to review July 31, 2025 15:36 — with GitHub Actions Inactive

caugner marked this pull request as draft August 19, 2025 11:35

LeoMcA force-pushed the fluent-ast branch from 8265135 to a4dbb0d Compare August 26, 2025 07:56

LeoMcA had a problem deploying to review August 26, 2025 07:56 — with GitHub Actions Failure

LeoMcA force-pushed the fluent-ast branch from a4dbb0d to af8be44 Compare August 26, 2025 08:02

LeoMcA temporarily deployed to review August 26, 2025 08:02 — with GitHub Actions Inactive

LeoMcA force-pushed the fluent-ast branch from af8be44 to b163734 Compare August 26, 2025 08:10

LeoMcA temporarily deployed to review August 26, 2025 08:10 — with GitHub Actions Inactive

LeoMcA force-pushed the fluent-ast branch from b163734 to 2b6298b Compare August 26, 2025 08:17

LeoMcA temporarily deployed to review August 26, 2025 08:17 — with GitHub Actions Inactive

LeoMcA marked this pull request as ready for review August 26, 2025 08:21

LeoMcA requested a review from caugner August 26, 2025 08:21

LeoMcA mentioned this pull request Aug 26, 2025

L10n Review PR #225

Draft

bcolsson reviewed Sep 3, 2025

View reviewed changes

caugner removed the request for review from a team October 22, 2025 10:20

LeoMcA marked this pull request as draft January 30, 2026 16:54

LeoMcA added 9 commits February 3, 2026 17:08

chore(l10n): move fluent files

40e0960

chore(l10n): load locales on demand

23d5342

chore(l10n): require l10n tags to have an id

82e1a3b

fix(l10n): don't return html type from l10n template tag

f38b6c8

chore(l10n): add string scraper/extractor script

3903375

chore(l10n): add lint mode to extractor

8407f21

chore(l10n): add generated qa locale

3245e89

chore(l10n): add docs

538881a

chore(l10n): script to generate missing ids

f30f28b

run l10n:extract

82dcb10

LeoMcA force-pushed the fluent-ast branch from 2b6298b to 689801c Compare February 3, 2026 17:15

LeoMcA marked this pull request as ready for review February 3, 2026 17:22

LeoMcA requested a review from a team as a code owner February 3, 2026 17:22

LeoMcA added 4 commits February 3, 2026 17:28

run scripts/migrate-l10n/index.js

ea56bde

run prettier

b5974db

run l10n:extract

3dba163

chore(l10n): remove script to generate missing ids

153f4db

LeoMcA force-pushed the fluent-ast branch from 689801c to 153f4db Compare February 3, 2026 17:29

caugner reviewed Feb 6, 2026

View reviewed changes

		"l10n:extract": "node l10n/parser/extract.js",
		"l10n:lint": "node l10n/parser/extract.js --lint"

Conversation

LeoMcA commented Jul 24, 2025

Uh oh!

github-actions bot commented Jul 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

caugner commented Aug 19, 2025

Uh oh!

bcolsson left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

LeoMcA commented Feb 3, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

github-actions bot commented Jul 24, 2025 •

edited

Loading