Diffusion Fine-Tuning via Reparameterized Policy Gradient of the Soft Q-Function
Kang, H.*, Lee, J.*, Shin, W.*, Om, K., & Park, J. (2026). "Diffusion Fine-Tuning via Reparameterized Policy Gradient of the Soft Q-Function." ICLR.
Kang, H.*, Lee, J.*, Shin, W.*, Om, K., & Park, J. (2026). "Diffusion Fine-Tuning via Reparameterized Policy Gradient of the Soft Q-Function." ICLR.
Lee, J., Kim, M., Choi, S., Song, I., Yun, S., Kang, H., Shin, W., Yun, T., Om, K., & Park, J. (2026). "Diffusion Alignment as Variational Expectation-Maximization." ICLR.
Om, K.*, Sim, K.*, Yun, T.*, Kang, H., & Park, J. (2025). "Posterior Inference in Latent Space for Scalable Constrained Black-box Optimization." NeurIPS Workshop on Structured Probabilistic Inference & Generative Modeling (Oral).
Yun, T.*, Om, K.*, Lee, J., Yun, S., & Park, J. (2025). "Posterior Inference with Diffusion Models for High-dimensional Black-box Optimization." ICML; FPI @ ICLR Workshop.