⚠ This page is served via a proxy. Original site: https://github.com
This service does not collect credentials or authentication data.
Skip to content

Pinned Loading

  1. OLMo OLMo Public

    Modeling, training, eval, and inference code for OLMo

    Python 6.3k 696

  2. dolma dolma Public

    Data and tools for generating and inspecting OLMo pre-training data.

    Python 1.4k 163

  3. ai2thor ai2thor Public

    An open-source platform for Visual AI.

    C# 1.6k 267

  4. olmocr olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    Python 16.8k 1.3k

  5. OLMoE OLMoE Public

    OLMoE: Open Mixture-of-Experts Language Models

    Jupyter Notebook 958 91

Repositories

Showing 10 of 542 repositories
  • S2AND Public

    Semantic Scholar's Author Disambiguation Algorithm & Evaluation Suite

    allenai/S2AND’s past year of commit activity
    Python 102 20 5 1 Updated Jan 26, 2026
  • OLMo-core Public

    PyTorch building blocks for the OLMo ecosystem

    allenai/OLMo-core’s past year of commit activity
    Python 743 Apache-2.0 135 7 47 Updated Jan 26, 2026
  • datamap-rs Public

    Data mapping framework for rust stuff

    allenai/datamap-rs’s past year of commit activity
    Rust 44 Apache-2.0 4 0 2 Updated Jan 26, 2026
  • open-instruct Public

    AllenAI's post-training codebase

    allenai/open-instruct’s past year of commit activity
    Python 3,550 Apache-2.0 489 13 (1 issue needs help) 45 Updated Jan 26, 2026
  • olmo-cookbook Public

    OLMost every training recipe you need to perform data interventions with the OLMo family of models.

    allenai/olmo-cookbook’s past year of commit activity
    Python 64 Apache-2.0 11 1 32 Updated Jan 26, 2026
  • olmoearth_pretrain Public

    Earth system foundation model data, training, and eval

    allenai/olmoearth_pretrain’s past year of commit activity
    Python 129 23 2 18 Updated Jan 25, 2026
  • rslearn Public

    A tool for developing remote sensing datasets and models.

    allenai/rslearn’s past year of commit activity
    Python 70 Apache-2.0 12 28 7 Updated Jan 25, 2026
  • olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    allenai/olmocr’s past year of commit activity
    Python 16,806 Apache-2.0 1,334 39 17 Updated Jan 25, 2026
  • beaker-gantry Public

    Gantry is a CLI that streamlines running experiments in Beaker

    allenai/beaker-gantry’s past year of commit activity
    Python 32 Apache-2.0 7 2 2 Updated Jan 24, 2026
  • olmoearth_projects Public

    OlmoEarth projects

    allenai/olmoearth_projects’s past year of commit activity
    Python 55 11 12 5 Updated Jan 24, 2026