Undergraduate · Peking University · DAIR Lab
I am a junior undergraduate at Peking University, College of Engineering (COE), advised by Prof. Bin Cui and Asst. Prof. Xupeng Miao at the PKU-DAIR Lab.
I work on using AI to build AI: developing systems in which AI accelerates, automates, and extends the very process by which AI is created and deployed.
I am interested in turning AI inward on its own stack — the infrastructure it runs on, the algorithms that train it, the research process that produces it, and the physical embodiments through which it acts.
My research spans four layers of this stack:
LB-MCTS: Synergizing Large Language Models and Bayesian Optimization for Efficient CASH
Submitted to ICML 2026 · arXiv:2601.12355
01AI for AI Infrastructure
An open-source agent system that autonomously writes, profiles, and optimizes CUDA kernels — pushing AI further down its own software stack.
02AI for AI Algorithms
LLM-guided Monte Carlo Tree Search for the Combined Algorithm Selection and Hyperparameter optimization (CASH) problem. We use the LLM as a structural prior over the pipeline space, making Bayesian optimization dramatically more sample-efficient.
03AI for AI Research
Karpathy's LLM-Wiki vision, fully realized — a wiki-centric, full-lifecycle AI research platform powered by Claude Code. Twenty skills covering the loop from paper ingestion → idea generation → experiment design → paper writing → reviewer rebuttal. Knowledge compounds; failed experiments become anti-repetition memory.
An AI research-onboarding tool, built on the OpenClaw skill platform. Given a topic, it constructs a self-contained, growing knowledge base — field overview, foundational and frontier papers, peer reviews, knowledge graph — that a newcomer can converse with directly.
04AI for the Physical World
Vision-based, RL-driven non-prehensile grasping; RL controllers for stable and adaptive locomotion on quadruped and biped robots over varied terrain. Advised by Prof. He Wang.
A system in which embodied AI agents autonomously plan and conduct scientific experiments — research automation extended into the physical world.
Working on automating the AI stack across algorithms, research, and infrastructure. Advised by Prof. Bin Cui and Asst. Prof. Xupeng Miao.
Research on vision-based, RL-driven non-prehensile grasping; RL controllers for legged robots. Advised by Prof. He Wang.
Competitions & Recognition
Scholarships & Honors