Welcome to Fangxin Liu’s Homepage~

Fangxin (Leon) Liu is an Assistant Professor and Ph.D. Supervisor in the School of Computer Science at Shanghai Jiao Tong University (SJTU). He is a core member of the Scalable Computing and Systems Lab, collaborating closely with Prof. Haibing Guan and Prof. Li Jiang. He also serves as a Research Fellow at the Shanghai Qi Zhi Institute.

His research focuses on computer architecture and hardware-software co-design for efficient AI systems, particularly for LLM/VLM, Computing-in-Memory (CIM/PIM) architectures, and Brain-inspired Computing.

Dr. Liu has published over 60 papers, including 40+ in CCF Tier-A venues (e.g., ISCA, MICRO, ASPLOS, HPCA, PPoPP). His work has been recognized with the Best Paper Finalist at ISCA 2026, the Outstanding Paper Award at ACM MM 2025 (Systems Theme), the Best Paper Award at DATE 2022, and the HUAWEI Spark Award (火花奖).

40+ CCF Tier A 1st / Corr. Author
60+ Total Pubs 1st / Corr. Author
40% Cost Saved Applied at Huawei, Ant, etc.
6+ Major Awards Best Paper / Dissert.

His architectural and system solutions have been deployed by leading technology companies, including Huawei, Ant Group, ZTE, and Yizhu Tech., resulting in up to 40% computational cost reductions in large-scale AI deployments.


🔥 Recruitment

Our team is actively seeking self-motivated PhD, Master, and Undergraduate students interested in Computer Architecture, Efficient AI acceleration, and PIM Design. If you are interested, please email me your CV.


News

Apr. 27, 2026
🏆 Best Paper Candidate: Our paper "COMET: A Cooperative Scheduling Framework for Concurrent PIM/CPU Execution on Mobile Devices" has been selected as one of the five finalists for the Best Paper Award at ISCA 2026. Congratulations to Yilong and all co-authors on this prestigious honor!
Apr. 14, 2026
🛠️ Our high-performance Attention Sparse Acceleration Kernels have been officially integrated into the Huawei CANN (Compute Architecture for Neural Networks) software stack. Furthermore, our team has successfully passed the CANN Core Developer Certification, marking a significant step in bridging architectural research with large-scale industrial infrastructure.
Apr. 06, 2026
🚀 ACL 2026: Our paper (CSD) on Speculative Decoding Acceleration has been accepted to the ACL 2026 Main Conference. Congratulations to Xuwen and all co-authors!
Mar. 28, 2026
🚀 ISCA 2026: Three papers covering Sparse Matrix Multiplication (Harmonia), MoE Inference Optimization (STEP), and Mobile PIM/CPU Scheduling (COMET) have been accepted to the 53rd International Symposium on Computer Architecture. Congratulations to Jingkui, Ning, Yilong, and all co-authors!
Feb. 24, 2026
🚀 Five papers covering Neuromorphic Computing, 3DGS, MoE and PCIe Simulation have been accepted to DAC 2026. Congratulations to Haomin, Chenyang, Zhibai and all co-authors!
Feb. 04, 2026
📄 Our joint technical report with Huawei MindSpore team, HyperOffload, is released. It cuts peak memory by 26% with end-to-end performance lossless. arXiv: 2602.00748
Jan. 24, 2026
📄 Our paper "NICE: Deep Neural Network Acceleration via Hardware-Friendly Index Assisted Compression" has been accepted to ACM TACO 2026.
Jan. 21, 2026
🏆 Our work “TFLOP” has received the Special Feature Award at the ASP-DAC University LSI Design Contest 2026.
Nov. 26, 2025
📄 Our two papers on MoE memory bottleneck and 3DGS rendering have been accepted to ASPLOS 2026.
Nov. 11, 2025
📄 Two papers on graph-based memory and sparse Transformer acceleration accepted to PPoPP 2026.
Nov. 11, 2025
📄 Three papers on Modular Multiplication, LLM, and CPU-GPU computing accepted to DATE 2026.
Nov. 10, 2025
🏆 ASTER awarded Outstanding Paper in Systems Theme at ACM MM 2025.
🕒 Click to view all Archived News (2025 - 2022)
Nov. 08, 2025
📄 Two papers on 3DGS Acceleration accepted to HPCA 2026.
Nov. 08, 2025
📄 Paper "SpecQuant" accepted to AAAI 2026.
Oct. 11, 2025
🏆 Won First & Third Prize in the 2nd Chiplet Technology Open Source Competition.
Sep. 20, 2025
🏆 Won Grand Prize and Best Project Poster Award in CCF Sys2025 Graph Computing Competition.
Sep. 05, 2025
📄 Paper "BLADE" on DRAM-based LLM Acceleration accepted to ASP-DAC 2026.
Aug. 29, 2025
📰 "FlexQuant" framework reported by Ant Group Asystem Team.
Aug. 21, 2025
📄 Paper on Flexible Quantization for LLM accepted to EMNLP 2025.
Jul. 06, 2025
📄 Adaptive Dynamic Layer-skipping Framework for LLM accepted to ACM MM (Oral) 2025.
Jul. 01, 2025
📄 Three papers on PIM-LLM, circuit optimization, and PCIe tracing accepted to ICCAD 2025.
May. 03, 2025
📄 Paper on "Collision Detection Accelerator Based on RRAM-TCAMs" accepted to IEEE TCAD 2025.
Apr. 29, 2025
📄 Two papers on "PIM+NeRF" and "PIM+Database" accepted by ASPLOS 2026.
Aug. 26, 2024
💰 Received grant from NSFC Youth Fund for Adaptive Compression Encoding.
Jul. 06, 2024
🏆 Received 2023 ACM Shanghai Doctoral Dissertation Award.
Mar. 18, 2024
🏆 Received 2023 Shanghai CCF Outstanding Dissertation Award.
Dec. 29, 2023
📰 "SPARK" framework reported in Jiqizhixin (机器之心).
Nov. 18, 2022
📄 Paper "SIMSnn" accepted by DATE 2023.

🔬 Research Interests

His research focuses on Hardware-Software Co-design for efficient AI systems:

🚀 LLMs & Neural Network Acceleration
  • Algorithm-System Co-optimization & AI Deployment [ACL'26, ASPLOS'25, EMNLP'25, ACM MM’25 (Outstanding Paper), ISCA’25, ASP-DAC’25]
  • Execution & Micro-architecture Optimization [ISCA'26, LSI’25 Feature Awards, TACO'26, HPCA'25, HPCA'24, 2×DATE’25, TPDS’24, ASP-DAC’24, DAC’26]
  • Sparsity Compilation & Efficient Encoding Acceleration (e.g., Torus-based Saddlepoint Approximation) [PPoPP'26, HPCA’25, HPCA’24, DAC’24, ICCAD’25, TODAES’24, ASP-DAC’24]
💾 Computing-in-Memory (CiM/PIM) Architecture
  • Hardware-Algorithm Co-design & PIM Scheduling [ISCA’26 (Best Paper Finalist), ICCAD'25, MICRO’24, ASP-DAC’24, APPT’25, DATE’22 (Best Paper)]
👁️ Spatial Intelligence (Efficient 3D Perception & Rendering)
  • 3D Scene Reconstruction & Acceleration for High-Fidelity Real-Time Interaction [HPCA’26, ASPLOS’25, DAC’26] Video
  • Deformable Attention Optimization for Efficient 3D Detection [DAC’24]
🧠 Brain-inspired Neuromorphic Computing
  • Neuromorphic Algorithms & Brain-inspired Applications [ISCA'25, MICRO'24, DAC'24, DAC'23, AAAI'23, ICCAD’23, SIGIR’22]
🛡️ Hardware-assisted Secure & Trustworthy AI
  • Area-Efficient Cryptographic Design for LUT-based Modular Reduction [DATE’26, DAC’25]
  • Secure Neuromorphic Computing Architecture [TACO’25, DAC’24, ASP-DAC’24, DAC’23]

Recent Visits to this Site