Skip to content

Collect database with most used langauges #558

@vitonsky

Description

@vitonsky

Let's collect database of most used languages in machine readable format with next structure

  • ISO language code
  • total speakers count
  • native speakers count

Later we could use this database to prioritize search for volunteers to maintain a locale files. Or to automate maintaining with LLM.

Current goal is to find a source of information about languages usage. We need list with at least 200 languages.
Information about methodology of data extraction is mandatory. Otherwise this data would be useless, since may be randomly generated.

Some links (mostly useless garbage sites with pay walls and no info about methodology)

Metadata

Metadata

Assignees

No one assigned

    Labels

    devDevelopment and infrastructure taskshelp wantedExtra attention is needed

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions