Liu Yifeng

NLP & AI4Science Researcher

I am a student researching on the field of NLP and AI4Science. Python, C++ and LaTeX are my main program and text-generation languages. I have participated in many projects of AI. And I am out-going. See my Blog or Github.

My Work

CodeGeeX: A Code Generative Model for Multilingual Program Synthesis


Provider the idea of soft-score hierarchy for training;

KDD '23 paper: CodeGeeX: A Pre-Trained Model for Code Generation with Multilingual Benchmarking on HumanEval-X.

See CodeGeex Blog for more details.

My Projects

portfolio img

TPA

Tensor Product Attention Is All You Need

portfolio img

MARS

MARS: Unleashing the Power of Variance Reduction for Training Large Models

portfolio img

T-Rex

A Text-assisted Retrosynthesis Prediction Architecture

portfolio img

Capricorn

Enhancing Hi-C contact matrices for loop detection with a multi-view diffusion approach. (ISMB 2024 & Bioinformatics)

portfolio img

CodeGeeX

A large-scale multilingual code generative model with 13 billion parameters, pre-trained on a large code corpus of more than 20 programming languages.

portfolio img

Medical Multiomics

One non-AI and three AI methods of medical multiomics integration for COVID-19, ZIKV and TNBC datasets.

portfolio img

Heady Liar

A program for mugshot process based on DLIB and PyQt6.

portfolio img

Japan Mahjong

An offline game program with GUI based on C++, including most of the common service species of Japanese mahjong

portfolio img

Miku Crawler

A crawler project based on selenium crawling the data of some Vocaloid videos on BiliBili and generates a data query website using Django database.

portfolio img

Chinese Stratego

An online military chess (flip chess) system based on Qt6 based on IP addresses.

portfolio img

ICM Outstanding Award

The outstanding award of Question E (Carbon Sequestration Calculation), MCM/ICM 2022.

My Profile

Name Liu Yifeng(Lewis Yik-fung Lau)

From Fuling, Chongqing, China

Major Computer Science and Engineering

Phone +86 150 2393 8602

Blog lauyikfung.github.io/blog

Some Skills

I can program with C++, Python, MatLab as well as HTML5. And I can use LaTeX for text generation.

For Deep Learning fields such as NLP and AI4Science, I am proficient in modules such as PyTorch, Numpy and Transformers.

I have learned some about linguistics. Moreover, I am good at Chinese and English, and I can recognize a little of Japanese.

C++ 90%

Python 85%

Chinese 100%

English 90%

Japanese 40%

This is me

NLP & AI4Science Researcher

I am currently a Ph.D. student at UCLA, researching on optimization and architectures of LLMs. I was previously a Yao Class Student from Tsinghua University researching on the fields of NLP following Zhilin Yang (Now CEO of Moonshot Inc.) and AI4Science following Sheng Wang as a visiting student in the University of Washington.


I can program well with C++, Python, MatLab, HTML5 as well as LaTeX. And I have participated in lots of projects of AI, especially in the field of NLP.


I has won the gold medal in the 36th CPhO and obtained the Outstanding Award of Interdisciplinary Contest In Modeling(ICM). Also, I has got the comprehensive award of Tsinghua University twice.


I am responsible, for which I was once elected as the monitor of Yao Class. And I am also sunny, optimistic as well as approachable.