<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Research | AIML@K AI + Math Lab at Korea University</title><link>https://aiml-k.github.io/research/</link><atom:link href="https://aiml-k.github.io/research/index.xml" rel="self" type="application/rss+xml"/><description>Research</description><generator>Hugo Blox Builder (https://hugoblox.com)</generator><language>en-us</language><lastBuildDate>Wed, 10 Jun 2026 00:00:00 +0000</lastBuildDate><image><url>https://aiml-k.github.io/media/logo_hueb9af1ea948ba9e324912a22ecd54985_75745_300x300_fit_lanczos_3.png</url><title>Research</title><link>https://aiml-k.github.io/research/</link></image><item><title>Mathematical Foundations of AI</title><link>https://aiml-k.github.io/research/math-for-ai/</link><pubDate>Sat, 20 Jun 2026 00:00:00 +0000</pubDate><guid>https://aiml-k.github.io/research/math-for-ai/</guid><description>&lt;h2 id="overview">Overview&lt;/h2>
&lt;p>Modern AI systems work remarkably well, yet much of &lt;em>why&lt;/em> they work from algorithmic and scientific perspective remains poorly understood.
This theme builds mathematical foundations for learning systems, treating optimization landscapes, training dynamics, and generalization as objects we can analyze rather than merely observe.&lt;/p>
&lt;p>Our results span several fronts.
From optimization side, we have studied how training can bypass stationary points, and also dabbled on fractional-order gradient methods.
On deep learning model structure, we develop principled structural pruning — using bifurcation dynamics and projective geometry to decide what edges a network can lose without giving up its functional output.
From the representation learning viewpoint, we analyzed feature learning as a form of covariance learning and study the emergent linear separability of features in a network&amp;rsquo;s last layer.
The throughline is turning empirical phenomena into theory that predicts behavior of deep learning models, guides better model design, and deeper understanding of why modern AI works.&lt;/p>
&lt;h2 id="core-questions">Core Questions&lt;/h2>
&lt;ul>
&lt;li>What governs optimization dynamics, and when can training escape or bypass bad critical points?&lt;/li>
&lt;li>How much of a network is truly necessary, and how do we prune with provable structure rather than heuristics?&lt;/li>
&lt;li>How do useful features and clean geometric structure emerge in representations during training?&lt;/li>
&lt;/ul>
&lt;h2 id="representative-work">Representative Work&lt;/h2>
&lt;ul>
&lt;li>&lt;a href="https://aiml-k.github.io/publication/2024tnnls-jung-lee/" target="_blank" rel="noopener">Bypassing Stationary Points in Training Deep Learning Models&lt;/a> — &lt;em>IEEE TNNLS, 2024&lt;/em>&lt;/li>
&lt;li>&lt;a href="https://aiml-k.github.io/publication/2025spl-cha-lee/" target="_blank" rel="noopener">Curse of Smoothness in Functional Neural Networks&lt;/a> — &lt;em>IEEE Signal Processing Letters, 2025&lt;/em>&lt;/li>
&lt;li>&lt;a href="https://aiml-k.github.io/publication/2025icml-hildworkshop-catalyst/" target="_blank" rel="noopener">Catalyst: Structured Pruning with Robust Bifurcation Dynamics&lt;/a> — &lt;em>ICML 2025 HiLD Workshop&lt;/em>&lt;/li>
&lt;li>&lt;a href="https://aiml-k.github.io/publication/2025neurips-optworkshop/" target="_blank" rel="noopener">Feature Learning as a Virtual Covariance Learning&lt;/a> — &lt;em>NeurIPS 2025 OPT Workshop&lt;/em>&lt;/li>
&lt;li>&lt;a href="https://aiml-k.github.io/publication/2025icml-hildworkshop-linsep/" target="_blank" rel="noopener">Emergent Linear Separability of Unseen Data Points in High-dimensional Last-Layer Feature Space&lt;/a> — &lt;em>ICML 2025 HiLD Workshop&lt;/em>&lt;/li>
&lt;/ul>
&lt;p>See all work on the &lt;a href="https://aiml-k.github.io/publication/">Publications&lt;/a> page.&lt;/p>
&lt;h2 id="related">Related&lt;/h2>
&lt;ul>
&lt;li>Logs tagged &lt;a href="https://aiml-k.github.io/tag/math4ai/">Math4AI&lt;/a>&lt;/li>
&lt;li>&lt;a href="https://aiml-k.github.io/event/">Events&lt;/a> and reading groups on learning theory&lt;/li>
&lt;/ul>
&lt;h2 id="people">People&lt;/h2>
&lt;ul>
&lt;li>&lt;a href="https://aiml-k.github.io/author/donghun-lee/">Donghun Lee&lt;/a> — Principal Investigator&lt;/li>
&lt;li>&lt;a href="https://aiml-k.github.io/author/taehun-cha/">Taehun Cha&lt;/a> — feature learning, last-layer geometry&lt;/li>
&lt;li>&lt;a href="https://aiml-k.github.io/author/jaeheun-jung/">Jaeheun Jung&lt;/a> — structured pruning, training dynamics&lt;/li>
&lt;li>&lt;a href="https://aiml-k.github.io/author/bosung-jung/">Bosung Jung&lt;/a> — optimization, unlearning&lt;/li>
&lt;/ul>
&lt;p>See &lt;a href="https://aiml-k.github.io/people/">People&lt;/a> for the full lab.&lt;/p></description></item><item><title>Applied AI Systems</title><link>https://aiml-k.github.io/research/applied-ai/</link><pubDate>Fri, 19 Jun 2026 00:00:00 +0000</pubDate><guid>https://aiml-k.github.io/research/applied-ai/</guid><description>&lt;h2 id="overview">Overview&lt;/h2>
&lt;p>This theme is where our research meet real problem with practical constraints.
We build and study applied systems where data is messy, latency matters, and reliability is as important as accuracy — and we feed what we learn in deployment back into the lab&amp;rsquo;s core research.&lt;/p>
&lt;p>The goal is twofold: deliver AI-powered systems that are genuinely useful, and stress-test research ideas against the friction of the real world.&lt;/p>
&lt;h2 id="core-questions">Core Questions&lt;/h2>
&lt;ul>
&lt;li>How do research methods hold up under real-world data, scale, and constraints?&lt;/li>
&lt;li>What makes an ML system dependable, maintainable, and genuinely useful?&lt;/li>
&lt;li>How can AI support teaching, learning, and sequential decision-making under uncertainty?&lt;/li>
&lt;/ul>
&lt;h2 id="representative-work">Representative Work&lt;/h2>
&lt;ul>
&lt;li>&lt;a href="https://aiml-k.github.io/publication/2025kcc-shin-lee/" target="_blank" rel="noopener">Agent-based Instructional Support Chatbot&lt;/a> — &lt;em>KCC 2025&lt;/em>&lt;/li>
&lt;li>&lt;a href="https://aiml-k.github.io/publication/2025pakdd-lee-lee-lee/" target="_blank" rel="noopener">Investigating the Limits of Graph Foundation Model in Real-World Travel Recommendation Systems&lt;/a> — &lt;em>PAKDD 2025 GLFM Workshop&lt;/em>&lt;/li>
&lt;li>&lt;a href="https://aiml-k.github.io/publication/10-1007-978-3-031-05936-0-26/" target="_blank" rel="noopener">Online Learning with Regularized Knowledge Gradients&lt;/a> — &lt;em>PAKDD 2022&lt;/em>&lt;/li>
&lt;li>&lt;a href="https://aiml-k.github.io/publication/lee-powell-2019-tac/" target="_blank" rel="noopener">Bias-Corrected Q-Learning With Multistate Extension&lt;/a> — &lt;em>IEEE TAC, 2019&lt;/em>&lt;/li>
&lt;/ul>
&lt;p>See all work on the &lt;a href="https://aiml-k.github.io/publication/">Publications&lt;/a> page.&lt;/p>
&lt;h2 id="related">Related&lt;/h2>
&lt;ul>
&lt;li>Logs tagged &lt;a href="https://aiml-k.github.io/tag/application/">Application&lt;/a>&lt;/li>
&lt;li>Related &lt;a href="https://aiml-k.github.io/log/">logs&lt;/a> and &lt;a href="https://aiml-k.github.io/event/">events&lt;/a>&lt;/li>
&lt;/ul>
&lt;h2 id="people">People&lt;/h2>
&lt;ul>
&lt;li>&lt;a href="https://aiml-k.github.io/author/donghun-lee/">Donghun Lee&lt;/a> — Principal Investigator&lt;/li>
&lt;li>&lt;a href="https://aiml-k.github.io/author/dayeon-shin/">Dayeon Shin&lt;/a> — instructional chatbots&lt;/li>
&lt;li>&lt;a href="https://aiml-k.github.io/author/kyunghee-roh/">Kyunghee Roh&lt;/a> — ML systems and workloads&lt;/li>
&lt;li>&lt;a href="https://aiml-k.github.io/author/yeajin-lee/">Yeajin Lee&lt;/a> — applied recognition models&lt;/li>
&lt;/ul>
&lt;p>See &lt;a href="https://aiml-k.github.io/people/">People&lt;/a> for the full lab. Earlier work on recommendation and
education systems was led by alumni including &lt;a href="https://aiml-k.github.io/author/nayoung-lee/">Nayoung Lee&lt;/a> and &lt;a href="https://aiml-k.github.io/author/yanggee-kim/">Yanggee Kim&lt;/a>.&lt;/p></description></item><item><title>AI for Science and Engineering</title><link>https://aiml-k.github.io/research/ai-for-science/</link><pubDate>Thu, 18 Jun 2026 00:00:00 +0000</pubDate><guid>https://aiml-k.github.io/research/ai-for-science/</guid><description>&lt;h2 id="overview">Overview&lt;/h2>
&lt;!-- Many problems in science and engineering are governed by equations that are expensive to solve and hard to invert.
This theme develops machine-learning methods — especially operator learning and physics-aware generative models — that approximate these systems efficiently and generalize across conditions. -->
&lt;p>We believe that mathematics-guided AI application can cover a wide range of science and engineering fields.&lt;/p>
&lt;p>We built neural operators that use both Laplace and Fourier representations for generalizable, efficient operator learning algorithm suitable for many PDE problems.
In seismology we showed that AI can create high quality broadband ground motion and ambient seismic noise with diffusion and generative models.&lt;br>
In material science and engineering we created AI-driven pipelines for estimating and optimizing mechanical properties of epoxy polymers. Also, transfer-learning across data-rich and data-poor battery imaging , and synthesize bar-link mechanism designs directly from specification.
In biomedical field we developed multimodal models for knee-osteoarthritis diagnosis and automated measurement of
spinal parameters.
Across all of these, the common thread is to abstractify the governing nature of the underlying problem.&lt;/p>
&lt;h2 id="core-questions">Core Questions&lt;/h2>
&lt;ul>
&lt;li>How can neural operators learn solution maps for whole families of PDEs (often used in STEM problems), efficiently and with guarantees of generalization?&lt;/li>
&lt;li>How do we build generative models that respect physical structure and stay reliable even when facing out of distribution cases?&lt;/li>
&lt;li>How can effective surrogates be designed to replace or accelerate classical simulation in science and engineering?&lt;/li>
&lt;/ul>
&lt;h2 id="representative-work">Representative Work&lt;/h2>
&lt;ul>
&lt;li>&lt;a href="https://aiml-k.github.io/publication/2025neurips-ml4psworkshop/" target="_blank" rel="noopener">Best of Both Worlds: Bridging Laplace and Fourier for Generalizable and Efficient Operator Learning&lt;/a> — &lt;em>NeurIPS 2025 ML4PS Workshop&lt;/em>&lt;/li>
&lt;li>&lt;a href="https://aiml-k.github.io/publication/2025icml-jung-lee-jung-kim-jung-lee/" target="_blank" rel="noopener">Broadband Ground Motion Synthesis by Diffusion Model with Minimal Condition&lt;/a> — &lt;em>ICML 2025&lt;/em>&lt;/li>
&lt;li>&lt;a href="https://aiml-k.github.io/publication/2025-clip-koa/" target="_blank" rel="noopener">CLIP-KOA: Enhancing Knee Osteoarthritis Diagnosis with Multi-Modal Learning&lt;/a> — &lt;em>MICCAI 2025 LMID Workshop&lt;/em>&lt;/li>
&lt;li>&lt;a href="https://aiml-k.github.io/publication/2025arxiv-smart-reexperimentation/" target="_blank" rel="noopener">Re-experiment Smart: Enhancing Data-driven Prediction of Mechanical Properties of Epoxy Polymers&lt;/a> — &lt;em>preprint, 2025&lt;/em>&lt;/li>
&lt;li>&lt;a href="https://aiml-k.github.io/publication/2024ieeegrsl-cho-ha-lim-han-kim-lee/" target="_blank" rel="noopener">PPSD GAN: PPSD-informed Generative Model for Ambient Seismic Noise Synthesizing&lt;/a> — &lt;em>IEEE GRSL, 2024&lt;/em>&lt;/li>
&lt;/ul>
&lt;p>See all work on the &lt;a href="https://aiml-k.github.io/publication/">Publications&lt;/a> page.&lt;/p>
&lt;h2 id="related">Related&lt;/h2>
&lt;ul>
&lt;li>Logs tagged &lt;a href="https://aiml-k.github.io/tag/ai4science/">AI4Science&lt;/a>&lt;/li>
&lt;li>&lt;a href="https://aiml-k.github.io/event/">Events&lt;/a> and collaborations with science and engineering partners&lt;/li>
&lt;/ul>
&lt;h2 id="people">People&lt;/h2>
&lt;ul>
&lt;li>&lt;a href="https://aiml-k.github.io/author/donghun-lee/">Donghun Lee&lt;/a> — Principal Investigator&lt;/li>
&lt;li>&lt;a href="https://aiml-k.github.io/author/jeongun-ha/">Jeongun Ha&lt;/a> — operator learning&lt;/li>
&lt;li>&lt;a href="https://aiml-k.github.io/author/jaehyuk-lee/">Jaehyuk Lee&lt;/a> — seismic and generative modeling&lt;/li>
&lt;li>&lt;a href="https://aiml-k.github.io/author/jaeheun-jung/">Jaeheun Jung&lt;/a> — ground-motion synthesis, mechanism design&lt;/li>
&lt;li>&lt;a href="https://aiml-k.github.io/author/hanyoung-kim/">Hanyoung Kim&lt;/a> — generative seismic modeling&lt;/li>
&lt;li>&lt;a href="https://aiml-k.github.io/author/minseok-choi/">Minseok Choi&lt;/a> — Visiting Scholar (POSTECH), scientific machine learning&lt;/li>
&lt;/ul>
&lt;p>See &lt;a href="https://aiml-k.github.io/people/">People&lt;/a> for the full lab.&lt;/p></description></item><item><title>Language, Reasoning, and Knowledge</title><link>https://aiml-k.github.io/research/language-reasoning/</link><pubDate>Wed, 17 Jun 2026 00:00:00 +0000</pubDate><guid>https://aiml-k.github.io/research/language-reasoning/</guid><description>&lt;h2 id="overview">Overview&lt;/h2>
&lt;p>Language models (LMs) have become general-purpose reasoning engines, yet they reason unevenly and sometimes state confident falsehoods.
This theme studies the internal representations of LMs and how they affect the behavior of LM-based system, e.g. knowledge retrieval, reasoning, and self-assessment, to name a few.&lt;/p>
&lt;p>Hallucination is a famous example: we have shown that pre-trained language models return distinguishable probability distributions on unfaithfully hallucinated text, and we have mapped the types of hallucination that arise in question answering along with the limits of the metrics used to detect them.
We also probe how far models can extrapolate beyond their training distribution — for instance, into chemical domains — and we develop better representations and retrieval, from sentence-level topic models to aspect-based dense passage retrieval.
Bringing in reinforcement learning to control the behavior of LMs (akin to how harnessing agents work nowadays) to high-stakes real-world text, such as extracting the consequences of central bank communications.&lt;/p>
&lt;h2 id="core-questions">Core Questions&lt;/h2>
&lt;ul>
&lt;li>What causes hallucination, and how can it be detected, measured, and reduced towards trustable and explainable language models?&lt;/li>
&lt;li>How well can/do language models extrapolate beyond their training distribution?&lt;/li>
&lt;li>How should internal knowledge in LMs be represented and retrieved so that reasoning over text becomes more reliable?&lt;/li>
&lt;/ul>
&lt;h2 id="representative-work">Representative Work&lt;/h2>
&lt;ul>
&lt;li>&lt;a href="https://aiml-k.github.io/publication/2024emnlp-cha-lee/" target="_blank" rel="noopener">Pre-trained Language Models Return Distinguishable Probability Distributions to Unfaithfully Hallucinated Texts&lt;/a> — &lt;em>EMNLP 2024 Findings&lt;/em>&lt;/li>
&lt;li>&lt;a href="https://aiml-k.github.io/publication/cha-lee-2024-sentencelda/" target="_blank" rel="noopener">SentenceLDA: Discriminative and Robust Document Representation with Sentence Level Topic Model&lt;/a> — &lt;em>EACL 2024&lt;/em>&lt;/li>
&lt;li>&lt;a href="https://aiml-k.github.io/publication/2024acl-cha-lee/" target="_blank" rel="noopener">Evaluating Extrapolation Ability of Large Language Model in Chemical Domain&lt;/a> — &lt;em>ACL 2024 Workshop&lt;/em>&lt;/li>
&lt;li>&lt;a href="https://aiml-k.github.io/publication/2026ce-cha-lee/" target="_blank" rel="noopener">Consequence-Guided Information Extraction for Predicting Central Bank Communication&amp;rsquo;s Effect&lt;/a> — &lt;em>Computational Economics, 2026&lt;/em>&lt;/li>
&lt;/ul>
&lt;p>See all work on the &lt;a href="https://aiml-k.github.io/publication/">Publications&lt;/a> page.&lt;/p>
&lt;h2 id="related">Related&lt;/h2>
&lt;ul>
&lt;li>Logs tagged &lt;a href="https://aiml-k.github.io/tag/languagemodel/">LanguageModel&lt;/a>&lt;/li>
&lt;li>&lt;a href="https://aiml-k.github.io/event/">Events&lt;/a> and discussions on LLMs and reasoning&lt;/li>
&lt;/ul>
&lt;h2 id="people">People&lt;/h2>
&lt;ul>
&lt;li>&lt;a href="https://aiml-k.github.io/author/donghun-lee/">Donghun Lee&lt;/a> — Principal Investigator&lt;/li>
&lt;li>&lt;a href="https://aiml-k.github.io/author/taehun-cha/">Taehun Cha&lt;/a> — hallucination, extrapolation, representation&lt;/li>
&lt;li>&lt;a href="https://aiml-k.github.io/author/suhyun-bae/">Suhyun Bae&lt;/a> — hallucination in question answering&lt;/li>
&lt;li>&lt;a href="https://aiml-k.github.io/author/hanyoung-kim/">Hanyoung Kim&lt;/a> — dense passage retrieval&lt;/li>
&lt;/ul>
&lt;p>See &lt;a href="https://aiml-k.github.io/people/">People&lt;/a> for the full lab.&lt;/p></description></item><item><title>AI for Mathematics</title><link>https://aiml-k.github.io/research/ai-for-math/</link><pubDate>Mon, 15 Jun 2026 00:00:00 +0000</pubDate><guid>https://aiml-k.github.io/research/ai-for-math/</guid><description>&lt;h2 id="overview">Overview&lt;/h2>
&lt;p>Mathematics is the most precise of the sciences, built on logical deduction and proof.
The tools by which mathematicians explore, conjecture, and verify are now changing: powerful computation and modern AI are turning parts of mathematical research into a collaboration between human and machine.
As an AI-research lab housed in a department of mathematics, this is our home ground.&lt;/p>
&lt;p>A recurring keyword is &lt;em>representation&lt;/em>, i.e. how mathematical objects should be encoded so that a modern AI model can show reasoning behavior, explainable AI aspects, faithfulness in semantics, and on.
We have shown that numbers can be richly represented to deep neural networks using its p-adic expansions, and that a principled Fourier basis tied to prime structure gives a clean theoretical foundation to understand why language models struggle to learn modular arithmetics.
We also studied how AI handles mathematical language before the advent of ChatGPT, using math word problems with non-numeric answers and customized tool-calling operations that modern AI services rely on to answer computation-related questions.&lt;/p>
&lt;h2 id="core-questions">Core Questions&lt;/h2>
&lt;ul>
&lt;li>How should numbers, structures, and proofs be represented so that models can reason about them faithfully rather than by surface pattern?&lt;/li>
&lt;li>Which mathematical tasks can AI genuinely assist, and where does it break down?&lt;/li>
&lt;li>How can computation, conjecture, and proof reinforce one another?&lt;/li>
&lt;/ul>
&lt;h2 id="representative-work">Representative Work&lt;/h2>
&lt;ul>
&lt;li>&lt;a href="https://aiml-k.github.io/publication/2026icml-hwang-bae-lee/" target="_blank" rel="noopener">Prime Fourier Embeddings: A Principled Basis for Modular Arithmetic&lt;/a> — &lt;em>ICML 2026 AI for Math Workshop&lt;/em>&lt;/li>
&lt;li>&lt;a href="https://aiml-k.github.io/publication/2025neurips-mathaiworkshop/" target="_blank" rel="noopener">Numbers Already Carry Their Own Embeddings&lt;/a> — &lt;em>NeurIPS 2025 MathAI Workshop&lt;/em>&lt;/li>
&lt;li>&lt;a href="https://aiml-k.github.io/publication/cha-etal-2022-noun/" target="_blank" rel="noopener">Noun-MWP: Math Word Problems Meet Noun Answers&lt;/a> — &lt;em>COLING 2022&lt;/em>&lt;/li>
&lt;/ul>
&lt;p>See all work on the &lt;a href="https://aiml-k.github.io/publication/">Publications&lt;/a> page.&lt;/p>
&lt;h2 id="related">Related&lt;/h2>
&lt;ul>
&lt;li>Logs tagged &lt;a href="https://aiml-k.github.io/tag/ai4math/">AI4Math&lt;/a> · &lt;a href="https://aiml-k.github.io/tag/conjecture/">conjecture&lt;/a>&lt;/li>
&lt;li>&lt;a href="https://aiml-k.github.io/event/">Events&lt;/a> and talks on AI for mathematics&lt;/li>
&lt;/ul>
&lt;h2 id="people">People&lt;/h2>
&lt;ul>
&lt;li>&lt;a href="https://aiml-k.github.io/author/donghun-lee/">Donghun Lee&lt;/a> — Principal Investigator&lt;/li>
&lt;li>&lt;a href="https://aiml-k.github.io/author/suhyun-bae/">Suhyun Bae&lt;/a> — number embeddings, modular arithmetic&lt;/li>
&lt;li>&lt;a href="https://aiml-k.github.io/author/taehun-cha/">Taehun Cha&lt;/a> — mathematical language and reasoning&lt;/li>
&lt;/ul>
&lt;p>See &lt;a href="https://aiml-k.github.io/people/">People&lt;/a> for the full lab.&lt;/p></description></item></channel></rss>