Sitemap
A list of all the posts and pages found on the site. For you robots out there, there is an XML version available for digesting as well.
Pages
Posts
Blog Post number 1
Published:
This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.
portfolio
Portfolio item number 1
Published:
Short description of portfolio item number 1
Portfolio item number 2
Published:
Short description of portfolio item number 2 
publications
23.3 EdgeDiff: 418.4 mJ/Inference Multi-Modal Few-Step Diffusion Model Accelerator with Mixed-Precision and Reordered Group Quantization
Published in IEEE International Solid-State Circuits Conference (ISSCC), 2025, 2025
LightRot: A Light-weighted Rotation Scheme and Architecture for Accurate Low-bit Large Language Model Inference
Published in IEEE Journal on Emerging and Selected Topics in Circuits and Systems, 2025
EdgeDiff: Multi-modal Few-step Diffusion Model Accelerator with Mixed-Precision and Reordered Group-Quantization for On-device Generative AI Motivation
Published in IEEE Hot Chips 37 Symposium (HCS), 2025, 2025
An Energy-Efficient High Resolution Vision Transformer Processor Exploiting Token Similarity Beyond Token Merging
Published in IEEE Transactions on Very Large Scale Integration (VLSI) Systems, 2025, 2025
EdgeDiff: Energy-Efficient Multi-Modal Few-Step Diffusion Model Accelerator Using Mixed-Precision and Reordered Group Quantization
Published in IEEE Journal of Solid-State Circuits, 2025, 2025
GyRot: Leveraging Hidden Synergy between Rotation and Fine-grained Group Quantization for Low-bit LLM Inference
Published in IEEE International Symposium on High-Performance Computer Architecture (HPCA), 2026, 2026
31.2 Revolver: Low-Bit GenAI Accelerator for Distilled-Model and CoT with Phase-Aware-Quantization and Rotation-Based Integer-Scaled Group Quant
Published in IEEE International Solid-State Circuits Conference (ISSCC), 2026, 2026
A 198.7 μJ/token Block Diffusion LLM Processor with Mask Token Similarity-Based Acitivation Reuse
Published in IEEE International Symposium on Circuits and Systems (ISCAS), 2026, 2026
SeVeDo: A Heterogeneous Transformer Accelerator for Low-Bit Inference via Hierarchical Group Quantization and SVD-Guided Mixed Precision
Published in IEEE International Symposium on Circuits and Systems (ISCAS), 2026, 2026
SliceMoE: Bit-Sliced Expert Caching under Miss-Rate Constraints for Efficient MoE Inference
Published in 63rd ACM/IEEE Design Automation Conference (DAC), 2026, 2026
ELMoE-3D: Leveraging Intrinsic Elasticity of MoE for Hybrid-Bonding-Enabled Self-Speculative Decoding in On-Premises Serving
Published in arXiv, 2026, 2026
talks
Talk 1 on Relevant Topic in Your Field
Published:
This is a description of your talk, which is a markdown file that can be all markdown-ified like any other post. Yay markdown!
Conference Proceeding talk 3 on Relevant Topic in Your Field
Published:
This is a description of your conference proceedings talk, note the different field in type. You can put anything in this field.
teaching
Teaching experience 1
Undergraduate course, University 1, Department, 2014
This is a description of a teaching experience. You can use markdown like any other post.
