AGI Watch Active
GitHub
N
AI Nerd Hub

The Daily AI Command Center

Real-time model benchmarks, pricing matrices, and community pulses. Caught up in 60 seconds.

🔍
Limit:
Modality:
97.3
#1
OpenAI: GPT-5.5 Pro Logo
OpenAI: GPT-5.5 Pro
97.3
#2
OpenAI: GPT-5.5 Logo
OpenAI: GPT-5.5
97.3
#3
OpenAI: GPT-5.4 Image 2 Logo
OpenAI: GPT-5.4 Image 2
97.3
#4
OpenAI: GPT-5.4 Nano Logo
OpenAI: GPT-5.4 Nano
97.3
#5
OpenAI: GPT-5.4 Mini Logo
OpenAI: GPT-5.4 Mini
97.3
#6
OpenAI: GPT-5.4 Pro Logo
OpenAI: GPT-5.4 Pro
97.3
#7
OpenAI: GPT-5.4 Logo
OpenAI: GPT-5.4
97.3
#8
OpenAI: GPT-5.3 Chat Logo
OpenAI: GPT-5.3 Chat
97.3
#9
OpenAI: GPT-5.3-Codex Logo
OpenAI: GPT-5.3-Codex
97.3
#10
OpenAI: GPT-5.2-Codex Logo
OpenAI: GPT-5.2-Codex
97.3
#11
OpenAI: GPT-5.2 Chat Logo
OpenAI: GPT-5.2 Chat
97.3
#12
OpenAI: GPT-5.2 Pro Logo
OpenAI: GPT-5.2 Pro
97.3
#13
OpenAI: GPT-5.2 Logo
OpenAI: GPT-5.2
97.3
#14
OpenAI: GPT-5.1-Codex-Max Logo
OpenAI: GPT-5.1-Codex-Max
97.3
#15
OpenAI: GPT-5.1 Logo
OpenAI: GPT-5.1
97.3
#16
OpenAI: GPT-5.1 Chat Logo
OpenAI: GPT-5.1 Chat
97.3
#17
OpenAI: GPT-5.1-Codex Logo
OpenAI: GPT-5.1-Codex
97.3
#18
OpenAI: GPT-5.1-Codex-Mini Logo
OpenAI: GPT-5.1-Codex-Mini
97.3
#19
OpenAI: GPT-5 Image Mini Logo
OpenAI: GPT-5 Image Mini
97.3
#20
OpenAI: GPT-5 Image Logo
OpenAI: GPT-5 Image
Showing 20 of 336 modelsShowing text-relevant models only

🔥 This Morning2026-06-30

  • GPT-5 scores 89.2% on MMLU-Pro, retaking the #1 slot on the LMSYS Arena Chatbot leaderboard.
  • DeepSeek releases V3.2 weights, introducing high-performance coder modalities at $0.14 per million input tokens.
  • EU AI Act enforcement phase begins today, requiring developer registries for models exceeding 10^26 FLOPs.
Updated: 06:00 UTC

💬 Social Pulse SentimentAI Synthesized

Scaling laws are hitting a wall43%
Open source is catching up fast31%
Agents are the next platform18%
AI safety concerns are ignored8%

📄 Trending Research Papers (No Abstracts)

Digests →
Program-as-Weights: A Programming Paradigm for Fuzzy FunctionsBy Wentao Zhang, Liliana Hotsko, Woojeong Kim, Pengyu Nie, Stuart Shieber, Yuntian Deng
▲ 68
💬 7
AgenticSTS: A Bounded-Memory Testbed for Long-Horizon LLM AgentsBy Xiangchen Cheng, Yunwei Jiang, Jianwen Sun, Zizhen Li, Chuanhao Li, Xiangcheng Cao, Yihao Liu, Fanrui Zhang, Li Jin, Kaipeng Zhang
▲ 43
💬 3
EvoPolicyGym: Evaluating Autonomous Policy Evolution in Interactive EnvironmentsBy Zhilin Wang, Han Song, Runzhe Zhan, Jusen Du, Jiacheng Chen, Tianle Li, Qingyu Yin, Yulun Wu, Zhennan Shen, Tong Zhu, Yanshu Li, Guanjie Chen, Derek F. Wong, Yafu Li, Yu Cheng, Yang Yang
▲ 41
💬 9
PerceptionRubrics: Calibrating Multimodal Evaluation to Human PerceptionBy Yana Wei, Hongbo Peng, Yanlin Lai, Liang Zhao, Kangheng Lin, En Yu, Keyu Lv, Han Zhou, Yin Tang, Haodong Li, Mitt Huang, Hangyu Guo, Jianjian Sun, Zheng Ge, Xiangyu Zhang, Daxin Jiang, Vishal M. Patel
▲ 38
💬 2
Morphing into Hybrid Attention ModelsBy Disen Lan, Jianbin Zheng, Yuxi Ren, Xin Xia, Xuanda Wang, Xuefeng Xiao, Xipeng Qiu, Yu Cheng
▲ 36
💬 3
Multi-Resolution Flow Matching: Training-Free Diffusion Acceleration via Staged SamplingBy Xingyu Zheng, Xianglong Liu, Yifu Ding, Weilun Feng, Junqing Lin, Jinyang Guo, Haotong Qin
▲ 25
💬 2
AgenticDataBench: A Comprehensive Benchmark for Data AgentsBy Zhaoyan Sun, Shan Zhong, Daizhou Wen, Jiaxing Han, Guoliang Li, Ying Yan, Peng Zhang, Yu Su, Xiang Qi, Baolin Sun, Chengyuan Yang, Tao Fang, Huaiyu Ruan
▲ 25
💬 4
ELDR: Expert-Locality-Aware Decode Routing for PD-Disaggregated MoE ServingBy Sangjin Choi, Sukmin Cho, Yifan Xiong, Ziyue Yang, Youngjin Kwon, Peng Cheng
▲ 24
💬 7
Multimodal Continuous Reasoning via Asymmetric Mutual Variational LearningBy Shijie Li, Yilin Gao, Siyuan Yang, Tieyuan Chen, Chaofan Gan, Zhihao He, Zicheng Zhao, Yuyu Guo, Weiyao Lin, Hang Yu
▲ 22
💬 7
MemSyco-Bench: Benchmarking Sycophancy in Agent MemoryBy Zhishang Xiang, Zerui Chen, Yunbo Tang, Zhimin Wei, Ruqin Ning, Yujie Lin, Qinggang Zhang, Jinsong Su
▲ 22
💬 6
WorldDirector: Building Controllable World Simulators with Persistent Dynamic MemoryBy Hanlin Wang, Hao Ouyang, Qiuyu Wang, Wen Wang, Qingyan Bai, Ka Leong Cheng, Yue Yu, Yixuan Li, Yihao Meng, Zichen Liu, Yanhong Zeng, Yujun Shen, Qifeng Chen
▲ 20
💬 7
ASPIRE: Agentic /Skills Discovery for RoboticsBy Runyu Lu, Yubo Wu, Ethan Kou, Letian Fu, Wenli Xiao, Ajay Mandlekar, Yinzhen Xu, Guanya Shi, Ken Goldberg, Ang Chen, Mosharaf Chowdhury, Yuke Zhu, Linxi "Jim" Fan, Guanzhi Wang
▲ 18
💬 8
Breaking Failure Cascades: Step-Aware Reinforcement Learning for Medical Multimodal ReasoningBy Junha Jung, Minbyul Jeong, Suhyeon Lim, Sungwook Jung, Jaehoon Yun, Taeyun Roh, Mujeen Sung, Jaewoo Kang
▲ 17
💬 2
Logit-Contribution Scoring Identifies Non-Literal Retrieval HeadsBy Aryo Pradipta Gema, Beatrice Alex, Pasquale Minervini
▲ 14
💬 3
AGI Watch:~0y 0d 0h 0m 0s
85% ProbTimeline →

🤖 Top AI Agents

View Directory →
#1Cursorcoding
$20/month★ 4.8
#2AutoGPTautonomous
Free/OSS★ 3.8
#3Aidercoding
Free/OSS (BYOK)★ 4.6

🛠️ Top Tools and Packages

View Directory →
#1Ollamalocal-execution
175.4k★+1200
#2vLLMinference-engine
85.3k★+320
#3Unslothfine-tuning
67.8k★+890