jingmingzhuo

Follow

Jingming Zhuo jingmingzhuo

Follow

10 followers · 15 following

https://jingmingzhuo.github.io/

Achievements

Achievements

Highlights

Pro

Stars

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 12,520 1,364 Updated Mar 27, 2025

open-compass / Creation-MMBench

Assessing Context-Aware Creative Intelligence in MLLMs

JavaScript 11 Updated Mar 26, 2025

xlang-ai / BRIGHT

BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval

Python 92 9 Updated Feb 12, 2025

StigLidu / TURN

Official Repo for Paper "Optimizing Temperature for Language Models with Multi-Sample Inference"

Python 15 Updated Feb 16, 2025

simplescaling / s1

s1: Simple test-time scaling

Python 6,072 710 Updated Mar 6, 2025

CaraJ7 / MME-CoT

MME-CoT: Benchmarking Chain-of-Thought in LMMs for Reasoning Quality, Robustness, and Efficiency

Python 92 2 Updated Mar 21, 2025

GAIR-NLP / LIMO

LIMO: Less is More for Reasoning

Python 872 39 Updated Feb 24, 2025

Quehry / HelloBench

HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models

Python 39 1 Updated Nov 26, 2024

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 5,730 566 Updated Mar 26, 2025

RAGEN-AI / RAGEN

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 1,244 87 Updated Mar 26, 2025

deepseek-ai / DeepSeek-V3

Python 93,880 15,225 Updated Mar 16, 2025

deepseek-ai / DeepSeek-R1

87,558 11,307 Updated Feb 24, 2025

THUDM / LongWriter

[ICLR 2025] LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

Python 1,631 160 Updated Oct 29, 2024

OSU-NLP-Group / UGround

[ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents

Python 193 12 Updated Mar 24, 2025

xinyan-cxy / EmpathyAgent

Python 7 Updated Mar 18, 2025

AkariAsai / OpenScholar

This repository includes the official implementation of OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs.

Python 659 64 Updated Feb 7, 2025

WayneJin0918 / SOTA-paper-rating.io

A tiny paper rating web

HTML 36 Updated Mar 19, 2025

PlusLabNLP / VISCO

[CVPR 2025] VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning

Python 10 1 Updated Mar 1, 2025

sotopia-lab / sotopia

Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)

Python 202 28 Updated Mar 26, 2025

metauto-ai / agent-as-a-judge

🤠 Agent-as-a-Judge and DevAI dataset

Python 384 51 Updated Jan 20, 2025

stanford-oval / storm

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Python 23,619 2,062 Updated Jan 23, 2025

ShengranHu / ADAS

[ICLR 2025] Automated Design of Agentic Systems

Python 1,231 183 Updated Jan 28, 2025

All-Hands-AI / OpenHands

🙌 OpenHands: Code Less, Make More

Python 51,352 5,679 Updated Mar 27, 2025

TIGER-AI-Lab / MEGA-Bench

This repo contains the code for "MEGA-Bench Scaling Multimodal Evaluation to over 500 Real-World Tasks" [ICLR2025]

Python 60 6 Updated Mar 26, 2025

web-arena-x / webarena

Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"

Python 932 143 Updated Feb 7, 2025

open-compass / ProSA

[EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs

Python 24 2 Updated Oct 22, 2024

open-compass / CompassJudger

90 5 Updated Feb 25, 2025

xlang-ai / OSWorld

[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Python 1,719 204 Updated Mar 6, 2025

microsoft / PhiCookBook

This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi models are the most capable and cost-effective small langua…

Jupyter Notebook 3,118 388 Updated Mar 26, 2025

xinyan-cxy / IPR-RLDF

Python 8 Updated Jan 1, 2025