Jue Wang

Hello, I am currently a senior staff researcher at Together AI, working closely with Prof. Ce Zhang. Before that, I got my Ph.D. degree from Zhejiang University, advised by Prof. Lidan Shou.

My recent research mainly focuses on efficient and cost-effective algorithms and systems for LLMs:

Updates

  • Mar 2026: Congrats to the team on the TorchSpec release – excited to see speculative decoding training scaled up effectively!

  • Jan 2026: We got 3 papers accepted to MLSys and 1 paper accepted to ICLR 2026! Congratulation to the collaborators!

  • Dec 2025: I’m happy to serve as the HPCA 2026 Artifact Evaluation Chair. Have fun in Sydney!

  • May 2025: We got three papers accepted to ICML 2025! Congratulation to the collaborators!

  • Jan 2025: We got three papers accepted to ICLR 2025! Congratulation to the collaborators!

  • Jun 2024: Check out Together MoA! Achieving SoTA results with open-source models only.

  • May 2024: We had a paper accepted to ACL 2024. Congratulation to the collaborators!

  • May 2024: We had a paper accepted to ICML 2024. Congratulation to the collaborators!

  • Sep 2023: We had a paper accepted to NeurIPS.

  • Aug 2023: LLaMA-7B-32K and LLaMA-7B-32K-Instruct have been released.

  • Jun 2023: RedPajama-7B-v1 has been released.

  • Apr 2023: We got two papers accepted to ICML 2023!

  • Mar 2023: OpenChatKit has been released, cheers!

  • Nov 2022: Check out our demo of GPT-JT!

  • Nov 2022: We had a paper accepted to AAAI 2023. Congratulation to the collaborators!

  • Nov 2022: Check out our benchmark on LLMs!

  • Sep 2022: We had a paper accepted to NeurIPS 2022. Congratulation and thanks to all the collaborators!

  • Apr 2022: We got a paper accepted to IJCAI 2022.

  • Mar 2022: I had a visit to ETH Zurich.

  • Feb 2022: As the first author, I had a paper accepted to ACL 2022.

  • Jun 2021: I graduated from CentraleSupélec with diplôme d’Ingénieur (master degree), cheers!

  • Dec 2020: As the first author, I had a paper accepted to AAAI 2021.

  • Sep 2020: As the first author, I had a paper accepted to EMNLP 2020.

  • Apr 2020: As the first author, I had a paper accepted to ACL 2020.

Work Experience

  • Together AI, Senior Staff Researcher, May 2025 - Now
  • Together AI, Staff Researcher, July 2023 - May 2025
  • Rokid, Research Intern, Jun 2018 - Sep 2018

Education

  • Zhejiang University, PhD in Computer Science, Sep 2018 - Jun 2023
  • ETH Zurich, Academic Guest, Mar 2021 - Sep 2021
  • Université Paris Saclay (CentraleSupélec), Master (Engineer) in General Engineering, Sep 2016 - Jun 2018
  • Zhejiang University, Bachelor in Electrical Engineering, Sep 2014 - Jun 2018

Publications

Contact

251 Rhode Island St,

Together AI, San Francisco, CA 94103

Email: [email protected]