Junxiong Wang

PhD candidate
Department of Computer Science
Cornell University
Email: junxiong AT cs.cornell.edu
Github: https://github.com/jxiw
HuggingFace Models: https://huggingface.co/JunxiongWang

I am applying for a research position focused on language models and systems. Please reach out if you think I am a good candidate!
Research
Currently, I am working on the intersection of language models and systems. Previously, I worked on large-scale data processing systems and distributed computing systems.

Internship
I am incredibly fortunate to have worked on
  • Entity retrieval in the Siri Information Intelligence Knowledge Platform at Apple.
  • Automatic indexing for big data systems in the Data Systems Group at Microsoft Research.

Publication
Efficient Language Model
  • Junxiong Wang*, Daniele Paliotta*, Avner May, Alexander M. Rush, Tri Dao
    The Mamba in the Llama: Distilling and Accelerating Hybrid Models
    In submission
    A shorter version at ICML 2024, 2nd Workshop on Efficient Systems for Foundation Models (ES-FoMo)
  • Junxiong Wang, Tushaar Gangavarapu, Jing Nathan Yan, Alexander M. Rush
    MambaByte: Token-free Selective State Space Model
    Models, Video
    Conference on Language Modeling (CoLM), 2024
  • Junxiong Wang, Jing Nathan Yan, Albert Gu, Alexander M. Rush
    Pretraining Without Attention
    Findings of Empirical Methods in Natural Language Processing (EMNLP), 2023
    First non-attention bidirectional model which achieves BERT-level transfer learning on the GLUE benchmark
    Models, Slides
Information Retrieval
  • Junxiong Wang, Ali Mousavi, Omar Attia, Saloni Potdar, Alexander M. Rush, Umar Farooq Minhas, Yunyao Li
    Disambiguation via Fusion Entity Decoding
    North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Learned Data System
Distributed Computing
  • Marcos K. Aguilera*, Tudor David*, Rachid Guerraoui*, Junxiong Wang* (* alphabetical order for theory paper convention)
    Locking Timestamps Versus Locking Objects
    ACM SIGACT-SIGOPS Symposium on Principles of Distributed Computing (PODC), 2018
    Associated code here

Teaching
    2022 Fall, CS 5781 - Machine Learning Engineering, Cornell Tech
    2020 Fall, CS 5320 - Database Systems Practicum, Cornell

Past