Undergraduate · Peking University · DAIR Lab
I am an undergraduate (Class of 2027) at Peking University, College of Engineering (COE), advised by Prof. Bin Cui at the PKU-DAIR Lab.
I work on using AI to build AI: developing systems in which AI accelerates, automates, and extends the very process by which AI is created and deployed.
I am interested in turning AI inward on its own stack — the infrastructure it runs on, the algorithms that train it, the research process that produces it, and the physical embodiments through which it acts.
My research spans four layers of this stack:
01AI for AI Infrastructure
An open-source agent system that autonomously writes, profiles, and optimizes CUDA kernels — pushing AI further down its own software stack.
02AI for AI Algorithms
LLM-guided Monte Carlo Tree Search for the Combined Algorithm Selection and Hyperparameter optimization (CASH) problem. We use the LLM as a structural prior over the pipeline space, making Bayesian optimization dramatically more sample-efficient.
03AI for AI Research
A memory-centric agent system for the full scientific research lifecycle — from literature ingestion and idea generation to experiment execution, manuscript writing, and reviewer rebuttal. Structured persistent memory separates reusable knowledge from project-level artifacts; a feedback-driven self-improvement loop converts experiment outcomes and review signals into versioned updates to skills and memory, so the system compounds across projects.
An AI research-onboarding tool, built on the OpenClaw skill platform. Given a topic, it constructs a self-contained, growing knowledge base — field overview, foundational and frontier papers, peer reviews, knowledge graph — that a newcomer can converse with directly.
04AI for the Physical World
Vision-based, RL-driven non-prehensile grasping; RL controllers for stable and adaptive locomotion on quadruped and biped robots over varied terrain. Advised by Prof. He Wang.
A system in which embodied AI agents autonomously plan and conduct scientific experiments — research automation extended into the physical world.
Working on automating the AI stack across algorithms, research, and infrastructure. Advised by Prof. Bin Cui.
Research on vision-based, RL-driven non-prehensile grasping; RL controllers for legged robots. Advised by Prof. He Wang.
Competitions & Recognition
Scholarships & Honors