Published an AWS Technical Blog post on LLM model quantization techniques for AWS Inferentia.
Read
Released NotaMoEQuant versions of Solar-Open-100B for efficient MoE-based LLM deployment.
INT4NVFP4
Won 1st Place in Track C and the Overall Grand Prize at the NVIDIA Nemotron Hackathon Seoul.
NVIDIA recapInterview
Released the arXiv preprint Value-and-Structure Alignment for Routing-Consistent Quantization of Mixture-of-Experts Models.
arXivPDF
Two MoE quantization papers were accepted to AdaptFM @ ICML 2026.
News
About
I am a Senior AI Research Engineer at Nota Inc. My work spans efficient LLM/VLM systems,
quantization, pruning, knowledge distillation, model porting, NPU/GPU-aware optimization,
vLLM-based serving, and uncertainty-aware NLP. I received my Ph.D. in Computer Science
from KAIST, advised by Jong C. Park.
Hancheol Park, Kyo-Joong Oh, Ho-Jin Choi, Gahgene Gweon. Constructing a Paraphrase Database for Agglutinative Languages. Data & Knowledge Engineering, 2019.
Huije Lee, Hancheol Park, Wonsuk Yang, Jong C. Park. Detection of Non-Standard Meaning Usage with Word Embedding. HCIK, 2018.
Wonsuk Yang, Hancheol Park, Jong C. Park. Neural Theorem Prover with Word Embedding for Efficient Automatic Annotation. Journal of KIISE, 2017.
Hancheol Park, Jung-Ho Kim, Jong C. Park. Addressing Low-Resource Problems in Statistical Machine Translation of Manual Signals in Sign Language. Journal of KIISE, 2017.
Hancheol Park, Gahgene Gweon, Jeong Heo. Affix Modification-Based Bilingual Pivoting Method for Paraphrase Extraction in Agglutinative Languages. BigComp, 2016. AFNLP Best Asian Paper Award
Hancheol Park, Gahgene Gweon. Initiating Moderation in Problematic Smartphone Usage Patterns. CHI Extended Abstracts, 2015.
Selected Projects
Sovereign AI Foundation Model Project
Technical owner and lead developer for MoE-specific compression, INT4/NVFP4 quantization, and expert pruning for Solar-Open models.
2025 - Present
LLM Porting and Optimization for Qualcomm NPUs
Optimization and porting workflows for Llama, Qwen, and EXAONE targeting Qualcomm NPU execution environments.
2025
Hybrid LLM System for SK Telecom
Hybrid routing system between mobile SLMs and server-side LLMs based on query difficulty, showcased at MWC 2025.
2024
Efficient VLMs for On-device Industrial Safety
Lightweight VLMs under 4B parameters deployed on Snapdragon-based mobile and QRB5165 industrial platforms.
2024
Awards & Honors
NVIDIA Nemotron Hackathon Seoul - Track C 1st Place and Overall Winner, 2026.
Team Lead, NetsPresso Application, Sep. 2022 - Dec. 2025.
Team Lead, NetsPresso Performance, Sep. 2020 - Dec. 2022.
Education
Korea Advanced Institute of Science and Technology (KAIST)
Ph.D. in Computer Science Thesis: Capturing Ambiguity in Natural Language Understanding Tasks with Information from Internal Layers. Advisor: Jong C. Park