Weitong Qian

Weitong Qian钱韦潼

Undergraduate · Peking University · DAIR Lab

I am an undergraduate (Class of 2027) at Peking University, College of Engineering (COE), advised by Prof. Bin Cui at the PKU-DAIR Lab.

I work on using AI to build AI: developing systems in which AI accelerates, automates, and extends the very process by which AI is created and deployed.

I am interested in turning AI inward on its own stack — the infrastructure it runs on, the algorithms that train it, the research process that produces it, and the physical embodiments through which it acts.

My research spans four layers of this stack:

  1. 01
    AI for AI Infrastructure Agents that write and optimize the code AI runs on.
  2. 02
    AI for AI Algorithms LLM-guided search for ML pipelines and hyperparameters.
  3. 03
    AI for AI Research Agent systems that compress the literature-to-publication loop.
  4. 04
    AI for the Physical World Embodied agents and world models that extend AI's reach beyond the screen.

Leadership & Service/ 01

Publications/ 02

Preprint

LB-MCTS: Synergizing Large Language Models and Bayesian Optimization for Efficient CASH

Beicheng Xu, Weitong Qian, Lingching Tung, Yupeng Lu, Bin Cui

arXiv:2601.12355

Projects/ 03

01AI for AI Infrastructure

GPU Kernel Optimization Agent In progress

An open-source agent system that autonomously writes, profiles, and optimizes CUDA kernels — pushing AI further down its own software stack.

02AI for AI Algorithms

LB-MCTS Preprint

LLM-guided Monte Carlo Tree Search for the Combined Algorithm Selection and Hyperparameter optimization (CASH) problem. We use the LLM as a structural prior over the pipeline space, making Bayesian optimization dramatically more sample-efficient.

03AI for AI Research

AutoSci Open source 968

A memory-centric agent system for the full scientific research lifecycle — from literature ingestion and idea generation to experiment execution, manuscript writing, and reviewer rebuttal. Structured persistent memory separates reusable knowledge from project-level artifacts; a feedback-driven self-improvement loop converts experiment outcomes and review signals into versioned updates to skills and memory, so the system compounds across projects.

FrontierPilot 2nd Prize · Lobster Hackathon

An AI research-onboarding tool, built on the OpenClaw skill platform. Given a topic, it constructs a self-contained, growing knowledge base — field overview, foundational and frontier papers, peer reviews, knowledge graph — that a newcomer can converse with directly.

04AI for the Physical World

Embodied RL @ Galbot Feb – Jun 2025

Vision-based, RL-driven non-prehensile grasping; RL controllers for stable and adaptive locomotion on quadruped and biped robots over varied terrain. Advised by Prof. He Wang.

AI Robot Scientist Experiment System 3rd Prize · Beijing Challenge Cup

A system in which embodied AI agents autonomously plan and conduct scientific experiments — research automation extended into the physical world.

Experience/ 04

PKU-DAIR Lab, Peking University — Undergraduate Researcher Jul 2025 – Present

Working on automating the AI stack across algorithms, research, and infrastructure. Advised by Prof. Bin Cui.

Galbot (北京银河通用机器人) — Algorithm Intern Feb 2025 – Jun 2025

Research on vision-based, RL-driven non-prehensile grasping; RL controllers for legged robots. Advised by Prof. He Wang.

Awards/ 05

Competitions & Recognition

Scholarships & Honors

News/ 06

  1. May 2026 AutoSci preprint released on arXiv.
  2. Apr 2026 Released AutoSci — a wiki-centric AI research lifecycle platform powered by Claude Code.
  3. Mar 2026 FrontierPilot wins 2nd Prize at the Zhongguancun OpenClaw Hackathon (Academic Track).
  4. Jan 2026 LB-MCTS preprint released on arXiv.
  5. Jul 2025 Joined PKU-DAIR Lab.

Friends/ 07