SMART unlocks latent multi-vector retrieval from frozen single-vector models as a plug-and-play upgrade; AutoResearch AI surveys the full spectrum of AI-powered scientific workflow automation; Tencent Hy-MT2 translation models and ByteDance Lance multimodal generator dominate HuggingFace trending; AI coding agent tooling consolidation accelerates with ECC (192K stars), andrej-karpathy-skills (155K stars), and Understand-Anything (31K stars) leading GitHub

AutoResearch AI: Towards AI-Powered Research Automation for Scientific Discovery

High Relevance

Guiyao Tie, Jiawen Shi, Dingjie Song, Yixiao Huang, Ziji Sheng et al. — Lehigh University, University of Illinois Chicago, Salesforce Research, Microsoft Research, Stanford University

A comprehensive survey examining AI-powered scientific workflow automation (AutoResearch), spanning the full spectrum from human-steered 'Vibe Research' through mixed-initiative co-research to emerging AI-led systems. Organizes the field around five workflow conditions and proposes five evaluation dimensions — novelty, validity, impact, reliability, and provenance — showing that autonomy credibility is domain-conditioned.

Key Findings

•
Introduces the AutoResearch framework distinguishing Vibe Research (human-steered) from AI-led systems across the autonomy spectrum
•
Proposes five evaluation dimensions (novelty, validity, impact, reliability, provenance) for comparing research automation systems
•
Shows AI research autonomy is domain-conditioned: more credible in structured, executable settings but limited in embodied or ethical contexts

ai-for-scienceresearch-automationsurveyai-agentsscientific-discovery

4 upvotes

Pantheon360: Taming Digital Twin Generation via 3D-Aware 360° Video Diffusion

Ting-Hsuan Chen, Ying-Huan Chen, Tao Tu, Jie-Ying Lee, Cho-Ying Wu — National Taiwan University, University of Southern California

Pantheon360 addresses digital twin generation through 360° video diffusion rather than perspective video generation. Panoramic coverage simplifies trajectory design and provides strong geometric priors that mitigate cross-view inconsistency and temporal drift — persistent problems when narrow field-of-view generators must stitch together long or multi-view trajectories.

Key Findings

•
360° video generation provides natural panoramic coverage that eliminates the cross-view inconsistency of narrow-FoV perspective generators
•
Panoramic priors simplify trajectory design for complete scene coverage in digital twin generation
•
3D-aware 360° diffusion maintains strict spatial-temporal consistency that perspective approaches struggle with

360-videodigital-twinsvideo-diffusion3d-generationscene-generation

1 upvotes

MetaphorVU: Towards Metaphorical Video Understanding

Zhuoqun Li, Boxi Cao, Guiping Jiang, Fangrui Lv, Ruotong Pan — Peking University, Chinese Academy of Sciences

MetaphorVU-Bench introduces the first systematic benchmark for metaphorical video understanding, targeting high-order cognitive capabilities that standard video benchmarks do not assess. Metaphorical videos are prevalent in advertising, film, and social media, but MLLMs have not been systematically evaluated on their ability to interpret figurative meaning in video.

Key Findings

•
First systematic and comprehensive benchmark dedicated to metaphorical video understanding across multiple cognitive dimensions
•
Reveals significant gaps in current MLLMs' ability to interpret figurative and metaphorical meaning in video content
•
Covers real-world scenarios including advertising, film, and social media where metaphorical communication is prevalent

video-understandingmetaphorbenchmarkmllmcognitive-reasoning

1 upvotes

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

High Relevance

Yifan Yang, Ziyang Gong, Weiquan Huang, Qihao Yang, Ziwei Zhou et al. — Microsoft Research, Fudan University

SkillOpt is the first systematic controllable text-space optimizer for agent skills, treating skills as external trainable agent state with stable updates and zero deployment inference overhead. It frames skill evolution with the discipline of weight-space gradient descent — using rollouts as gradient signal, add/delete/replace edits as parameter updates, and a textual learning-rate budget for stability. Achieves +23.5 accuracy points on GPT-5.5 in direct chat and +19.1 in Claude Code across six benchmarks.

Key Findings

•
First systematic text-space optimizer for agent skills, framing skill text as a trainable external parameter with gradient-descent-style updates
•
Achieves +23.5 accuracy on GPT-5.5 direct chat, +24.8 in Codex, and +19.1 in Claude Code across six benchmarks and seven target models
•
Zero deployment inference overhead — skills are optimized offline and applied as static context at inference time

agent-skillstext-optimizationllm-agentsskill-evolutionclaude-code

159 upvotes

Trending Models (12)

DeepSeek-V4-Pro

DeepSeek · text-generation · unknown

DeepSeek's flagship large language model with state-of-the-art performance on reasoning and coding tasks. Continues to dominate the open-weight model landscape with massive adoption.

conversationalreasoningcoding

4.8M downloads4.3K likes

Anima

Circlestone Labs · image-generation · unknown

Diffusion-based image generation model with strong community adoption and ComfyUI integration, targeting high-quality visual content creation.

diffusioncomfyuiimage-generation

651.7K downloads1.5K likes

Sulphur-2-base

SulphurAI · text-to-video · unknown

Leading open text-to-video generation model with massive download volume, available in both diffusers and GGUF formats for broad deployment flexibility.

text-to-videodiffusersgguf

1.4M downloads1.4K likes

MiniCPM-V-4.6

OpenBMB · image-text-to-text · unknown

Qwen3.6-35B-A3B-Uncensored-HauhauCS-Aggressive

Efficient multimodal vision-language model combining strong image-text understanding with compact parameter count, enabling on-device and edge deployments.

multimodalvision-languageefficient

285.4K downloads943 likes

HauhauCS (Community) · text-generation · 35B (3B active)

Community-tuned uncensored variant of the Qwen 3.6 35B MoE model with aggressive tuning for unrestricted text generation and vision tasks.

moeuncensoredvisionqwen3.6

1.4M downloads858 likes

Hy-MT2-1.8B

Tencent · translation · 1.8B

Tencent's compact machine translation model from the Hunyuan family, offering efficient multilingual translation with strong early community reception (823 likes).

translationhunyuanmultilingual

5.6K downloads823 likes

Lance

ByteDance Research · image-generation · unknown

Multimodal generation model supporting both image and video generation tasks, representing ByteDance's entry into open-weight multimodal content creation.

multimodalimage-generationvideo-generation

1.7K downloads820 likes

supertonic-3

Supertone · text-to-speech · unknown

Advanced text-to-speech model with ONNX support for efficient inference, delivering high-quality speech synthesis for production deployment.

ttsspeech-synthesisonnx

45.8K downloads675 likes

Qwen3.6-27B-MTP-GGUF

Unsloth · text-generation · 27B

Quantized GGUF version of Qwen 3.6 27B by Unsloth, optimized for local inference with multi-token prediction support.

ggufquantizedunslothqwen

695.3K downloads480 likes

Hy-MT2-30B-A3B

Tencent · translation · 30B (3B active)

Tencent's mixture-of-experts translation model with 30B total parameters but only 3B active per token, offering large-model translation quality at small-model inference cost.

translationmoehunyuan

1.5K downloads327 likes

HRM-Text-1B

Sapient Inc · text-generation · 1B

Command A+ (May 2026, W4A4)

Compact 1B parameter text generation model with notably high download volume relative to its size, suggesting strong adoption for lightweight deployment scenarios.

text-generationcompacthrm

90.0K downloads316 likes

Cohere Labs · image-text-to-text · unknown

Lum1104/Understand-Anything

Aggressively quantized (W4A4) version of Cohere's Command A+ vision-language model, enabling efficient deployment of a flagship multimodal model with 4-bit weights and activations.

quantizedvision-languageconversational

7.4K downloads200 likes

Trending GitHub Repos (14)

High RelevanceGitHub

Converts any codebase into an interactive knowledge graph for exploration, search, and natural language Q&A. Compatible with Claude Code, Codex, Cursor, Copilot, and Gemini CLI. Highest star velocity of the day at 5,604 stars today.

knowledge-graphcode-understandingai-coding-agent

TypeScript31.4K+5.6K today2.6K

colbymchenry/codegraph

High RelevanceGitHub

Pre-indexed code knowledge graph for Claude Code, Codex, Cursor, OpenCode, and Hermes Agent. Reduces token usage and tool calls by providing structured codebase context. 100% local execution.

knowledge-graphcode-contextai-coding-agentlocal

TypeScript25.3K+3.2K today1.4K

rohitg00/ai-engineering-from-scratch

High RelevanceGitHub

Comprehensive AI engineering curriculum covering the full stack from fundamentals to production deployment. Extremely high star velocity (3,154 stars today) reflects surging demand for structured AI engineering education.

educationai-engineeringcurriculum

Python18.7K+3.2K today3.2K

multica-ai/andrej-karpathy-skills

High RelevanceGitHub

A single CLAUDE.md file encoding behavioral heuristics for Claude Code, derived from Andrej Karpathy's observations on LLM coding pitfalls. 155K stars and 2,749 stars today indicate massive community adoption.

ai-coding-agentclaude-codeskillsbehavioral-alignment

155.1K+2.7K today15.9K

affaan-m/ECC

High RelevanceGitHub

Comprehensive agent harness performance optimization system with skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond. The largest repo in this cluster at 192K stars.

agent-harnessai-coding-agentskillsmemory

JavaScript192.5K+2.0K today29.8K

anthropics/knowledge-work-plugins

High RelevanceGitHub

Anthropic's open-source repository of plugins for knowledge workers to use in Claude Cowork. Official Anthropic release with strong first-day traction (1,441 stars today).

anthropicclaudepluginsknowledge-work

Python15.6K+1.4K today1.9K

mukul975/Anthropic-Cybersecurity-Skills

High RelevanceGitHub

754 structured cybersecurity skills for AI agents mapped to 5 frameworks (MITRE ATT&CK, NIST CSF 2.0, MITRE ATLAS, D3FEND, NIST AI RMF). Works with Claude Code, Copilot, Codex, Cursor, Gemini CLI and 20+ platforms.

cybersecurityai-agentsmitrenistskills

Python9.3K+1.0K today1.1K

garrytan/gstack

Garry Tan's Claude Code setup with 23 opinionated tools serving as CEO, Designer, Eng Manager, Release Manager, Doc Engineer, and QA roles. Strong endorsement signal for AI-assisted product development workflows.

claude-codeai-coding-agentworkflowtools

TypeScript102.5K+640 today15.3K

manaflow-ai/cmux

Ghostty-based macOS terminal with vertical tabs and notifications designed specifically for AI coding agents. Purpose-built terminal infrastructure for agent workflows.

terminalmacosai-coding-agentdeveloper-tools

Swift19.5K+603 today1.5K

666ghj/MiroFish

st-tech/ppf-contact-solver

Universal swarm intelligence engine for prediction tasks. Uses collective intelligence algorithms for general-purpose forecasting across domains.

swarm-intelligencepredictionforecasting

Python62.5K+534 today9.8K

microsoft/agent-governance-toolkit

Physics-based contact solver for simulations involving shells, solids and rods. Gains 432 stars today, reflecting growing interest in differentiable physics simulation tooling.

physics-simulationcontact-solverdifferentiable-physics

Python3.2K+432 today230

High RelevanceGitHub

Microsoft's AI Agent Governance Toolkit providing policy enforcement, zero-trust identity, execution sandboxing, and reliability engineering for autonomous AI agents. Covers all 10 OWASP Agentic Top 10 risks.

agent-governancesecurityzero-trustowaspmicrosoft

Python2.3K+271 today394

shiyu-coder/Kronos

Foundation model for the language of financial markets. Applies transformer architecture to financial time series and market data for prediction and analysis.

financial-aifoundation-modeltime-seriesmarkets

Python26.0K+245 today4.5K

sansan0/TrendRadar