Skip to content
View jingmingzhuo's full-sized avatar

Highlights

  • Pro

Block or report jingmingzhuo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SGLang is a fast serving framework for large language models and vision language models.

Python 12,520 1,364 Updated Mar 27, 2025

Assessing Context-Aware Creative Intelligence in MLLMs

JavaScript 11 Updated Mar 26, 2025

BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval

Python 92 9 Updated Feb 12, 2025

Official Repo for Paper "Optimizing Temperature for Language Models with Multi-Sample Inference"

Python 15 Updated Feb 16, 2025

s1: Simple test-time scaling

Python 6,072 710 Updated Mar 6, 2025

MME-CoT: Benchmarking Chain-of-Thought in LMMs for Reasoning Quality, Robustness, and Efficiency

Python 92 2 Updated Mar 21, 2025

LIMO: Less is More for Reasoning

Python 872 39 Updated Feb 24, 2025

HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models

Python 39 1 Updated Nov 26, 2024

verl: Volcano Engine Reinforcement Learning for LLMs

Python 5,730 566 Updated Mar 26, 2025

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 1,244 87 Updated Mar 26, 2025

[ICLR 2025] LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

Python 1,631 160 Updated Oct 29, 2024

[ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents

Python 193 12 Updated Mar 24, 2025
Python 7 Updated Mar 18, 2025

This repository includes the official implementation of OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs.

Python 659 64 Updated Feb 7, 2025

A tiny paper rating web

HTML 36 Updated Mar 19, 2025

[CVPR 2025] VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning

Python 10 1 Updated Mar 1, 2025

Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)

Python 202 28 Updated Mar 26, 2025

🤠 Agent-as-a-Judge and DevAI dataset

Python 384 51 Updated Jan 20, 2025

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Python 23,619 2,062 Updated Jan 23, 2025

[ICLR 2025] Automated Design of Agentic Systems

Python 1,231 183 Updated Jan 28, 2025

🙌 OpenHands: Code Less, Make More

Python 51,352 5,679 Updated Mar 27, 2025

This repo contains the code for "MEGA-Bench Scaling Multimodal Evaluation to over 500 Real-World Tasks" [ICLR2025]

Python 60 6 Updated Mar 26, 2025

Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"

Python 932 143 Updated Feb 7, 2025

[EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs

Python 24 2 Updated Oct 22, 2024

[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Python 1,719 204 Updated Mar 6, 2025

This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi models are the most capable and cost-effective small langua…

Jupyter Notebook 3,118 388 Updated Mar 26, 2025
Python 8 Updated Jan 1, 2025
Next
Showing results