Distributed RL System for LLM Reasoning
-
Updated
Jun 13, 2025 - Python
Distributed RL System for LLM Reasoning
MMaDA - Open-Sourced Multimodal Large Diffusion Language Models
[NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models
✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models
Awesome RL-based LLM Reasoning
An awesome repository & A comprehensive survey on interpretability of LLM attention heads.
历年ICLR论文和开源项目合集,包含ICLR2021、ICLR2022、ICLR2023、ICLR2024、ICLR2025.
Ling is a MoE LLM provided and open-sourced by InclusionAI.
Official code for "Iterative Self-Incentivization Empowers Large Language Models as Agentic Searchers"
[ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction
official code repo of CVPR 2025 paper PhyT2V: LLM-Guided Iterative Self-Refinement for Physics-Grounded Text-to-Video Generation
[arxiv: 2505.02156] Adaptive Thinking via Mode Policy Optimization for Social Language Agents
[AAAI 2025] ORQA is a new QA benchmark designed to assess the reasoning capabilities of LLMs in a specialized technical domain of Operations Research. The benchmark evaluates whether LLMs can emulate the knowledge and reasoning skills of OR experts when presented with complex optimization modeling tasks.
[ACL'2025 Findings] Official repo for "HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation Task"
The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".
[EMNLP 2024] A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners
Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs. EMNLP 2024
🔥🔥🔥Latest Papers, Codes on Uncertainty-based RL
Official code for ACL'25 Main: "Neural Incompatibility: The Unbridgeable Gap of Cross-Scale Parametric Knowledge Transfer in Large Language Models"
Add a description, image, and links to the llm-reasoning topic page so that developers can more easily learn about it.
To associate your repository with the llm-reasoning topic, visit your repo's landing page and select "manage topics."