Jasaxion一只大雄

风打,碎琉璃; 打不碎的,是那阳光漫地。The wind strikes, shattering the glazed glass; Yet unbroken remains the sunlight spilling across the earth.


主页
Zhan Shaoxiong
📫 Email
LinkedIn
GitHub
Twitter/X

🙋‍ Zhan Shaoxiong (詹少雄)
📧 zhansx24@mails.tsinghua.edu.cn / jasaxion@gmail.com
📍 Shenzhen, China · Tsinghua SIGS

I am a master's student at the Knowledge Engineering Lab of Tsinghua Shenzhen International Graduate School, advised by Prof. Haitao Zheng. My research focuses on natural language processing (NLP), information retrieval (IR), and large language models (LLMs), with particular interest in dense retrieval, semantic augmentation, and retrieval-augmented generation. I received a B.Eng. degree in Computer Science from Huazhong Agricultural University.


🔥 News

• 2025.11 — MathSmith accepted to AAAI’26 🎉
• 2025.11 — IntentionChain accepted to AAAI’26 🎉
• 2025.10 — HSPIM accepted to Information Sciences 🎉
• 2025.07 — LexSemBridge accepted to ECAI’25 🎉
• 2025.06 — IRSC Benchmark accepted to NLPCC’25 🎉
• 2025.02 — QAEA-DR accepted to IEEE TKDE 🎉

📝 Publications

MathSmith: Towards Extremely Hard Mathematical Reasoning by Forging Synthetic Problems with a Reinforced Policy
(AAAI, 2026) · arXiv:2508.05592
IntentionChain-of-Thought Prompting with Dynamic Routing for Code Generation
(AAAI, 2026)(co-author) · padding
A Hierarchical Framework for Measuring Scientific Paper Innovation via Large Language Models
(Information Science, 2025)(co-author) · arXiv:2504.14620
LexSemBridge: Fine-Grained Dense Representation Enhancement through Token-Aware Embedding Augmentation
(ECAI 2025) · arXiv:2508.17858
IRSC: A Zero-shot Evaluation Benchmark for Information Retrieval through Semantic Comprehension in Retrieval-Augmented Generation Scenarios
(NLPCC 2025)(co-author) · arXiv:2409.15763
QAEA-DR: A Unified Text Augmentation Framework for Dense Retrieval
(IEEE Transactions on Knowledge and Data Engineering, TKDE 2025)(co-author) · DOI: 10.1109/TKDE.2025.3543203

🎖 Honors and Awards

• National Scholarship in China (×3): 2021, 2022, 2023
• Gratitude Scholarship for Chinese Scientists, 2023
• MCM Finalist (Top 1%), 2023
• China University Big Data Challenge National Second Prize, 2022
• MathorCup National Second Prize, 2022

💻 Internships

SenseTime Research – LLM Algorithm Intern, Model Foundation Group, Shenzhen
Mar–Sep 2025
  • Focused on post-training optimization of large language models for mathematical reasoning.
  • Designed RL-based hard-problem synthesis strategies and built a large-scale “MathSmith” dataset to improve reasoning robustness.
  • Converted research into a first-author publication, integrating data synthesis and fine-tuning experiments.
XinDan Tech – RAG Algorithm Intern, Enterprise LLM System, Shenzhen
Dec 2023–Mar 2024
  • Developed enterprise-grade Retrieval-Augmented Generation (RAG) pipelines with LangChain and embedding fine-tuning.
  • Implemented OCR + multimodal fusion for document understanding and contributed to system design and delivery.
Huawei MindSpore & ISCAS – Open Source Intern, Online
Aug–Nov 2023
  • Built an AST-based PyTorch→MindSpore conversion module, enhancing framework interoperability and automation.
  • Strengthened understanding of AI framework architecture and open-source collaboration workflows.

🎨 Miscellaneous

👋I'm a hands-on tech enthusiast who enjoys tinkering with gadgets and experimenting with new ideas💡—even if things break along the way. Outside of work, you'll find me playing badminton🏸, swimming🏊‍♀️, or dancing💃 (hiphop/kpop). Always happy to connect—feel free to add me on WeChat: Jasaxion_Taurus0405 🤝

👥 Student Leadership

• Head of the Youth Media Center in HZAU, 2022–2023
• Deputy Party Branch Secretary in HZAU, 2023–2024
• Class President in HZAU, 2023–2024