Zexin Jane

Beijing Haidian, China

👋 Welcome! I am a Ph.D. Candidate in Computer Science at Institute of Computing Technology, Chinese Academy of Sciences (ICT, CAS), where I conduct research in the PL-System Research Group under the supervision of Dr. Huimin Cui and Dr. Chenxi Wang. Prior to that, I received my B.S. in Computer Science, from National University of Defense Technology (NUDT), , advised by Dr.Zengkun.

🤔 My research broadly covers computer architecture, compilers, heterogeneous computing, distributed systems, and deep learning. My current research interests include two main areas: (1) building system software using language models, and (2) creating scalable and cost-effective infrastructures for AI applications:

Performant and cost-effective infrastructures for AI applications: I work to design and implement Deep Learning (DL) Compilers and frameworks to improve the performance and energy efficiency of emerging AI applications on heterogeneous computing platforms.
LLM: CTG (ACL’23), Semi-Open-Ended QA (NIPS’24)

📫 If you are seeking any form of academic cooperation, please feel free to email me at zexinjian@gmail.com.

education

the Institute of Computing Technology, Chinese Academy of Sciences (ICT, CAS)

ph.D. in Computer Science

Sep.2025 - present

advised by Dr. Huimin Cui and Dr. Chenxi Wang

National University of Defense Technology (NUDT)

B.S. in Computer Science

Sep.2021 - Jul.2025

advised by Dr. Zengkun

experience

Advanced Institute of Information Technology (AIIT), Peking University

Research Intern

May.2025 - present

the Institute of Computing Technology, Chinese Academy of Sciences (ICT, CAS)

Visiting Student in Computer Science

Jun.2024 - Sep.2025

State Key Laboratory of High-Performance Computing (HPCL), NUDT

Research Intern

Sep.2022 - Seq.2024

selected publications

NeurIPS24
Perception of Knowledge Boundary for Large Language Models through Semi-open-ended Question Answering

Zhihua Wen, Zhiliang Tian, Zexin Jian, and 5 more authors

In The Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024

Abs Bib PDF

"Large Language Models (LLMs) are widely used for knowledge-seeking purposes yet suffer from hallucinations. The knowledge boundary of an LLM limits its factual understanding, beyond which it may begin to hallucinate. Investigating the perception of LLMs’knowledge boundary is crucial for detecting hallucinations and LLMs’ reliable generation. Current studies perceive LLMs’ knowledge boundary on questions with a concrete answer (close-ended questions) while paying limited attention to semi-open-ended questions that correspond to many potential answers. Some researchers achieve it by judging whether the question is answerable or not. However, this paradigm is not so suitable for semi-open-ended questions, which are usually partially answerable questions containing both answerable answers and ambiguous (unanswerable) answers. Ambiguous answers are essential for knowledge-seeking, but they may go beyond the knowledge boundary of LLMs. In this paper, we perceive the LLMs’ knowledge boundary with semi-open-ended questions by discovering more ambiguous answers. First, we apply an LLM-based approach to construct semi-open-ended questions and obtain answers from a target LLM. Unfortunately, the output probabilities of mainstream black-box LLMs are inaccessible to sample for low-probability ambiguous answers. Therefore, we apply an open-sourced auxiliary model to explore ambiguous answers for the target LLM. We calculate the nearest semantic representation for existing answers to estimate their probabilities, with which we reduce the generation probability of high-probability existing answers to achieve a more effective generation. Finally, we compare the results from the RAG-based evaluation and LLM self-evaluation to categorize four types of ambiguous answers that are beyond the knowledge boundary of the target LLM. Following our method, we construct a dataset to perceive the knowledge boundary for GPT-4. We find that GPT-4 performs poorly on semi-open-ended questions and is often unaware of its knowledge boundary. Besides, our auxiliary model, LLaMA-2-13B, is effective in discovering many ambiguous answers, including correct answers neglected by GPT-4 and delusive wrong answers GPT-4 struggles to identify. "
@inproceedings{LLMBoundary-Nips24, title = {Perception of Knowledge Boundary for Large Language Models through Semi-open-ended Question Answering}, author = {Wen, Zhihua and Tian, Zhiliang and Jian, Zexin and Huang, Zhen and Ke, Pei and Gao, Yifu and Huang, Minlie and Li, Dongsheng}, booktitle = {The Thirty-eighth Annual Conference on Neural Information Processing Systems}, year = {2024}, }
ACL-finding23
GRACE: gradient-guided controllable retrieval for augmenting attribute-based text generation

Zhihua Wen, Zhiliang Tian, Zhen Huang, and 4 more authors

In Findings of the Association for Computational Linguistics: ACL 2023, Jul 2023

Abs Bib PDF

"Attribute-based generation methods are of growing significance in controlling the generation of large pre-trained language models (PLMs). Existing studies control the generation by (1) finetuning the model with attributes or (2) guiding the inference processing toward control signals while freezing the PLM. However, finetuning approaches infuse domain bias into generation, making it hard to generate out-of-domain texts. Besides, many methods guide the inference in its word-by-word generation, pushing the word probability to the target attributes, resulting in less fluent sentences. We argue that distilling controlling information from natural texts can produce fluent sentences while maintaining high controllability. In this paper, we propose GRAdient-guided Controllable rEtrieval (GRACE), a retrieval-augmented generation framework to facilitate the generation of fluent sentences with high attribute relevance. GRACE memorizes the semantic and attribute information from unlabeled corpora and applies a controllable retrieval to obtain desired information. For the generation, we design techniques to eliminate the domain bias from the retrieval results and integrate it into the generation model. Additionally, we propose a gradient-guided generation scheme that iteratively steers generation toward higher attribute relevance. Experimental results and quantities of examples verify the effectiveness of our method."
@inproceedings{GRACE-ACL23, title = {GRACE: gradient-guided controllable retrieval for augmenting attribute-based text generation}, author = {Wen, Zhihua and Tian, Zhiliang and Huang, Zhen and Yang, Yuxin and Jian, Zexin and Wang, Changjian and Li, Dongsheng}, booktitle = {Findings of the Association for Computational Linguistics: ACL 2023}, month = jul, publisher = {"Association for Computational Linguistics"}, url = {"https://aclanthology.org/2023.findings-acl.530/"}, pages = {8377--8398}, year = {2023}, }