Tim-Siu

Follow

🤩

Shuyao "Tim" Xu Tim-Siu

🤩

Follow

@ NUS

27 followers · 47 following

National University of Singapore
Singapore
tim-siu.github.io

Achievements

Achievements

Highlights

Pro

Pinned Loading

reinforcement-distillation reinforcement-distillation Public

Code repo for "Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning"

Python 27
reft-exp reft-exp Public

A research repo for experiments about Reinforcement Finetuning

Python 49 2
MarkBind/markbind MarkBind/markbind Public

MarkBind is a tool for generating content-heavy websites from source files in Markdown format

HTML 148 138
llm-random-prune llm-random-prune Public

ICLR 25 SLLM: The Surprising Effectiveness of Randomness in LLM Pruning

Python 1