llm-reasoning

Star

Here are 37 public repositories matching this topic...

inclusionAI / AReaL

Star

Distributed RL System for LLM Reasoning

reinforcement-learning rl machine-learning-systems mlsys llm llm-reasoning

Updated Jun 13, 2025
Python

Gen-Verse / MMaDA

Star

MMaDA - Open-Sourced Multimodal Large Diffusion Language Models

diffusion-models llm-reasoning unified-multimodal-understanding-and-generation

Updated Jun 13, 2025
Python

YangLing0818 / buffer-of-thought-llm

Star

[NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models

large-language-models chain-of-thought-reasoning retrieval-augmented-generation llm-reasoning

Updated Mar 23, 2025
Python

reasoning-survey / Awesome-Reasoning-Foundation-Models

Star

✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models

reasoning multimodal foundation-models llm reasoning-agent llm-reasoning reasoning-language-models

Updated Apr 25, 2025

bruno686 / Awesome-RL-based-LLM-Reasoning

Star

Awesome RL-based LLM Reasoning

reinforcment-learning llm llm-reasoning rl-based-llm-reasoning

Updated May 4, 2025

IAAR-Shanghai / Awesome-Attention-Heads

Star

An awesome repository & A comprehensive survey on interpretability of LLM attention heads.

awesome survey transformer gpt attention-mechanism research-paper circuit-analysis interpretability cognitive-neuroscience visualization-tools large-language-models llm chain-of-thought llm-reasoning machine-psychology attention-head-mining

Updated Mar 2, 2025
TeX

yinizhilian / ICLR2025-Papers-with-Code

Star

历年ICLR论文和开源项目合集，包含ICLR2021、ICLR2022、ICLR2023、ICLR2024、ICLR2025.

python machine-learning transformer gpt nlp-machine-learning nlp-keywords-extraction iclr2021 paperwithcode iclr2022 llms iclr2023 llm-agent llm-training gemmini llm-framework iclr2024 llm-reasoning llama3 deep-learning-paper

Updated Mar 14, 2025

inclusionAI / Ling

Star

Ling is a MoE LLM provided and open-sourced by InclusionAI.

machine-learning rl moe llm llm-reasoning

Updated May 14, 2025
Python

mangopy / SearchLM

Star

Official code for "Iterative Self-Incentivization Empowers Large Language Models as Agentic Searchers"

rag large-language-models retrieval-augmented-generation llm-reasoning

Updated May 28, 2025
Python

YangLing0818 / SuperCorrect-llm

Star

[ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction

reflection self-correction dpo llm llm-reasoning

Updated Mar 23, 2025
Python

pittisl / PhyT2V

Star

official code repo of CVPR 2025 paper PhyT2V: LLM-Guided Iterative Self-Refinement for Physics-Grounded Text-to-Video Generation

video-generation diffusion-models prompt-tuning llm-reasoning cvpr2025

Updated Mar 17, 2025
Python

MozerWang / AMPO

Star

[arxiv: 2505.02156] Adaptive Thinking via Mode Policy Optimization for Social Language Agents

agent large-language-models reasoning-agent llm-reasoning reasoning-language-models long-cot

Updated May 20, 2025
Python

[AAAI 2025] ORQA is a new QA benchmark designed to assess the reasoning capabilities of LLMs in a specialized technical domain of Operations Research. The benchmark evaluates whether LLMs can emulate the knowledge and reasoning skills of OR experts when presented with complex optimization modeling tasks.