Summary
Education
Skills
Software
Research & Academic Background
Project
Timeline
Generic
Huang Chi-Wei

Huang Chi-Wei

New Taipei

Summary

M.S. student in NCKU, passionate about AI computing optimization and computer architecture, specializing in model compression, quantization, and efficient LLM inference. Experienced in optimizing LLMs for resource-constrained hardware and developing advanced computing systems.

Education

Master of Science - Smart And Sustainable Manufacturing

National Cheng Kung University
Tainan City, Taiwan
06-2026

Bachelor of Science - Chemistry

National Cheng Kung University
Tainan City, Taiwan
06-2023

High School Diploma -

Taipei Municipal Chien Kuo High School
Taipei City, Taiwan
06-2018

Skills

  • Computer Architecture
  • Computer Organization
  • Algorithms
  • Data Structure
  • Natural Language Processing
  • Efficient Ai Model Design for ML and Inference
  • Ai Computing and Applications
  • Programming Design

Software

C/C

Python

RISC-V assembly

Chisel

English (CEFR) C1

Research & Academic Background

Research Focus: Model Compression, Quantization, Efficient LLM Inference
Advisor: Prof. Chai-Chi Tsai  
Lab: AI Systems Lab

Project

Google Research Project – AI-Based Speech Screening Tool

  • Integrated an LLM-based ASR model (Whisper) into a Flutter app, optimizing Whisper quantization for real-time speech analysis on Android (Pixel 6).


Capstone Project - Enhancing User Privacy Through Local Deployment of LLMs

  • Conducted model workload analysis and applied state-of-the-art quantization techniques on Llama models, using MLX (Apple ML) for deployment on Apple Silicon (M-series) to improve efficiency and privacy.
  • Developed a Next.js UI and built a FastAPI backend with Ollama, optimizing MLX server for real-time response streaming.


Course Project -  Performance Modeling and Optimization on μRISC-V Processor

  • Implemented RISC-V assembly for array multiplication, analyzing execution time and CPI to classify CPU vs. Memory-bound tasks.
  • Optimized array multiplication using RISC-V V Extension, achieving speedup >6x over baseline execution.


Capstone Project - Matrix Multiplication Accelerator

  • Designed a systolic array accelerator in Chisel with weight-stationary and output-stationary dataflows, optimizing GEMM operations for low-latency AI inference.

Timeline

Master of Science - Smart And Sustainable Manufacturing

National Cheng Kung University

Bachelor of Science - Chemistry

National Cheng Kung University

High School Diploma -

Taipei Municipal Chien Kuo High School
Huang Chi-Wei