top of page

Artificial Intelligence

Search and explore resources across
governance, risk and AI systems

No Items Found. Try adjusting filters or clearing selections.

First page of the research paper Boosting LLM Reasoning via Spontaneous Self-Correction.

Computer Science

Boosting LLM Reasoning via Spontaneous Self-Correc...

MetaAI, Mila - Quebec AI Institute, Polytechnique Montréal

First page of the research paper Accelerated Test-Time Scaling with Model-Free Speculative Sampling.

Computer Science

Accelerated Test-Time Scaling with Model-Free Spec...

Amazon AGI, KAIST

First page of the research paper ss-Mamba: Semantic-Spline Selective State-Space Model.

Computer Science

ss-Mamba: Semantic-Spline Selective State-Space Mo...

National Chengchi University

First page of the research paper DeepSeek in Healthcare: A Survey of Capabilities, Risks, and Clinical Applications of Open-Source Large Language Models.

Computer Science

DeepSeek in Healthcare: A Survey of Capabilities, ...

Cornell University, Johns Hopkins University, Touro University College of Osteopathic Medicine

First page of the research paper RMoA: Optimizing Mixture-of-Agents through Diversity
Maximization and Residual Compensation.

Computer Science

RMoA: Optimizing Mixture-of-Agents through Diversi...

East China Normal University, Meituan Inc., Donghua University, Tsinghua University

First page of the research paper Skywork Open Reasoner 1 Technical Report.

Computer Science

Skywork Open Reasoner 1 Technical Report

Skywork AI, Kunlun Inc.

First page of the research paper LLLMs: A Data-Driven Survey of Evolving Research on Limitations of Large Language Models.

Computer Science

LLLMs: A Data-Driven Survey of Evolving Research o...

University of Bielefeld, University of Mannheim, University of Technology Nuremberg

First page of the research paper In-Context Watermarks for Large Language Models.

Computer Science

In-Context Watermarks for Large Language Models

UC Berkeley, UC Santa Barbara, University of Florida

First page of the research paper Breaking Down Video LLM Benchmarks: Knowledge, Spatial Perception, or True Temporal Understanding?

Computer Science

Breaking Down Video LLM Benchmarks: Knowledge, Spa...

Apple

First page of the research paper JULI: Jailbreak Large Language Models by Self-Introspection.

Computer Science

JULI: Jailbreak Large Language Models by Self-Intr...

Wuhan University, University of California, Berkeley

First page of the research paper Qwen3 Technical Report .

Computer Science

Qwen3 Technical Report

Qwen Team

First page of the research paper Evaluating LLM Metrics Through Real-World Capabilities.

Computer Science

Evaluating LLM Metrics Through Real-World Capabili...

University of Sydney

First page of the research paper R-Bench: Graduate-level Multi-disciplinary Benchmarks for LLM & MLLM Complex Reasoning Evaluation.

Computer Science

R-Bench: Graduate-level Multi-disciplinary Benchma...

Tsinghua University, Stanford University, Carnegie Mellon University, University of Pennsylvania, Tencent Hunyuan X, Fitten

First page of the research paper VideoLLM Benchmarks and Evaluation: A Survey.

Computer Science

VideoLLM Benchmarks and Evaluation: A Survey

Indian Institute of Technology Jodhpur

First page of the research paper Toward Generalizable Evaluation in the LLM Era: A Survey Beyond Benchmarks.

Computer Science

Toward Generalizable Evaluation in the LLM Era: A ...

Fudan University, Nanyang Technological University, Singapore Management University, Tsinghua
University, Singapore University of Technology and Design, University of California Davis, National
University of Singapore, University of Illinois Urbana-Champaign, Australian National University

First page of the research paper HalluLens: LLM Hallucination Benchmark.

Computer Science

HalluLens: LLM Hallucination Benchmark

FAIR at Meta, GenAI at Meta, HKUST

First page of the research paper Knowledge Distillation and Dataset Distillation of Large Language Models: Emerging Trends, Challenges, and Future Directions.

Computer Science

Knowledge Distillation and Dataset Distillation of...

University of Georgia, University of Texas
at Arlington, Harvard University, Carnegie Mellon University, Vanderbilt University, Mayo Clinic Arizona, Augusta University

First page of the research paper Self-Correction Makes LLMs Better Parsers.

Computer Science

Self-Correction Makes LLMs Better Parsers

Soochow University

First page of the research paper InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models.

Computer Science

InternVL3: Exploring Advanced Training and Test-Ti...

Shanghai AI Laboratory, SenseTime Research, Tsinghua University, Nanjing University, Fudan University, The Chinese University of Hong Kong, Shanghai Jiao Tong University

First page of the research paper KG-LLM-Bench: A Scalable Benchmark for Evaluating LLM Reasoning on Textualized Knowledge Graphs.

Computer Science

KG-LLM-Bench: A Scalable Benchmark for Evaluating ...

University of Southern California, Independent Researcher, University of California, Riverside

  • Page 2
bottom of page