Skip to content
Change the repository type filter

All

    Repositories list

    • Ressys benchmark code repo
      Python
      0300Updated Sep 26, 2025Sep 26, 2025
    • Python
      0700Updated Sep 14, 2025Sep 14, 2025
    • Python
      01710Updated Jul 29, 2025Jul 29, 2025
    • Official repository for Deep Research Comparator: A Platform For Fine-grained Human Annotations of Deep Research Agents [EMNLP 2025]
      Python
      1500Updated Jul 24, 2025Jul 24, 2025
    • AutoRule

      Public
      Official repository for AutoRule: Reasoning Chain-of-thought Extracted Rule-based Rewards Improve Preference Learning
      Python
      0810Updated Jul 24, 2025Jul 24, 2025
    • Official repository for FactMM-RAG: Fact-Aware Multimodal Retrieval Augmentation for Accurate Medical Radiology Report Generation [NAACL 2025]
      Python
      11210Updated Jul 12, 2025Jul 12, 2025
    • Python
      0010Updated May 30, 2025May 30, 2025
    • Organize the Web: Constructing Domains Enhances Pre-Training Data Curation
      Jupyter Notebook
      4000Updated May 2, 2025May 2, 2025
    • Python
      0210Updated Apr 2, 2025Apr 2, 2025
    • Interpret and control dense embedding via sparse autoencoder.
      Python
      0600Updated Mar 5, 2025Mar 5, 2025
    • Craw4LLM

      Public
      Official repository for "Craw4LLM: Efficient Web Crawling for LLM Pretraining"
      Python
      5663740Updated Feb 24, 2025Feb 24, 2025
    • Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning [ICLR 2025]
      Python
      44810Updated Jan 24, 2025Jan 24, 2025
    • RAGViz

      Public
      Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]
      TypeScript
      138510Updated Jan 18, 2025Jan 18, 2025
    • MATES

      Public
      Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]
      Python
      97440Updated Nov 14, 2024Nov 14, 2024
    • esae

      Public
      Python
      0000Updated Oct 29, 2024Oct 29, 2024
    • Python
      0100Updated Oct 23, 2024Oct 23, 2024
    • Python
      1800Updated Aug 23, 2024Aug 23, 2024
    • Python
      0300Updated Jun 20, 2024Jun 20, 2024