International Journal

[#166] Empathetic Response in Audio-Visual Conversations Using Emotion Preference Optimization and MambaCompressor

Yeonju Kim, Se Jin Park, Yong Man Ro

IEEE Transactions on Affective Computing

[#165] Enhanced Vision-Language Models for Diverse Sensor Understanding: Cost-Efficient Optimization and Benchmarking

Sangyun Chung*, Youngjoon Yu*, Se Yeon Kim, Youngchae Chee, Yong Man Ro (*equal contribution)

IEEE Transactions on Image Processing / Code

[#164] GCAgent: Long-Video Understanding via Schematic and Narrative Episodic Memory

Jeong Hun Yeo*, Sangyun Chung*, Sungjune Park, Dae Hoe Kim, Jinyoung Moon, Yong Man Ro (*equal contribution)

IEEE Transactions on Multimedia

[#163] A Causal Lens on Non-RGB Vision Sensor Understanding in Vision Language Models

Youngjoon Yu, Yong Man Ro

IEEE Transactions on Image Processing, vol. 35, pp. 3909-3924, 2026

[#162] Adaptive Integration of Textual Context and Visual Embeddings for Underrepresented Vision Classification

Seongyeop Kim, Hyung-Il Kim, Yong Man Ro

Pattern Recognition, vol. 172, pp. 112420, 2026

[#161] Causal Unsupervised Semantic Segmentation

Junho Kim*, Byung-Kwan Lee*, Yong Man Ro (*equal contributor)

Pattern Recognition, vol. 171, pp. 112173, 2026

[#160] TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages

Minsu Kim, Jee-weon Jung, Hyeongseop Rha, Soumi Maiti, Siddhant Arora, Xuankai Chang, Shinji Watanabe, Yong Man Ro

IEEE Transactions on Multimedia, vol. 28, pp. 1976-1988, Jan. 2026

[#159] MSCoTDet: Language-driven Multi-modal Fusion for Improved Multispectral Pedestrian Detection

Taeheon Kim, Sangyun Chung, Damin Yeom, Youngjoon Yu, Hak Gu Kim, Yong Man Ro

IEEE Transactions on Circuits and Systems for Video Technology, vol. 35, no. 5, pp. 5006-5021, May 2025

[#158] Prompt Tuning of Deep Neural Networks for Speaker-adaptive Visual Speech Recognition

Minsu Kim, Hyeong-Il Kim, Yong Man Ro

IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 47, no. 2, pp. 1042-1055, Feb. 2025

[#157] Advancing Causal Intervention in Image Captioning with Causal Prompt

Youngjoon Yu, Yeonju Kim, Yong Man Ro

IEEE Transactions on Neural Networks and Learning Systems, vol. 36, no. 7, pp. 12631-12642, July 2025

[#156] Textless Unit-to-Unit training for Many-to-Many Multilingual Speech-to-Speech Translation

Minsu Kim, Jeongsoo Choi, Dahun Kim, Yong Man Ro

IEEE Transactions on Audio, Speech, and Language Processing, vol. 32, pp. 3934-3946, 2024. / Demo

[#155] Text-Guided Distillation Learning to Diversify Video Embeddings for Text-Video Retrieval

Sangmin Lee, Hyung-Il Kim, Yong Man Ro

Pattern Recognition, vol. 156, no. 3, pp. 110754, 2024.

[#154] Robust Pedestrian Detection via Constructing Versatile Pedestrian Knowledge Bank

Sungjune Park*, Hyunjun Kim*, Yong Man Ro (* equal contributor)

Pattern Recognition, vol. 153, no. 4, pp. 110539, 2024.

[#153] Integrating Language-Derived Appearance Elements with Visual Cues in Pedestrian Detection

Sungjune Park*, Hyunjun Kim*, Yong Man Ro (* equal contributor)

IEEE Transactions on Circuits and Systems for Video Technology, pp. 1-1, 2024.

[#152] AKVSR: Audio Knowledge Empowered Visual Speech Recognition by Compressing Audio Knowledge of a Pretrained Model

Jeong Hun Yeo, Minsu Kim, Jeongsoo Choi, Dae Hoe Kim, and Yong Man Ro

IEEE Transactions on Multimedia, vol. 26, pp. 6462-6474, 2024.

[#151] Defending Video Recognition Model against Adversarial Perturbations via Defense Patterns

Hong Joo Lee and Yong Man Ro

IEEE Transactions on Dependable and Secure Computing, vol. 21, no. 04, pp. 4110-4121, 2024.

[#150] Enabling Visual Object Detection with Object Sounds via Visual Modality Recalling Memory

Jung Uk Kim and Yong Man Ro

IEEE Transactions on Neural Networks and Learning Systems, pp. 1-13, 2023.

[#149] Adversarial anchor-guided feature refinement for adversarial defense

Hakmin Lee and Yong Man Ro

Image and Vision Computing, vol. 136, pp. 104722, 2023.

[#148] Robust Proxy: Improving Adversarial Robustness by Robust Proxy Learning

Hong Joo Lee and Yong Man Ro

IEEE Transactions on Information Forensics & Security, vol. 18, pp. 4021-4033, 2023

[#147] Deep learning-based classification system of bacterial keratitis and fungal keratitis using anterior segment images

Yeo Kyoung Won*, Hyebin Lee*, Youngjun Kim, Gyule Han, Tae-Young Chung, Yong Man Ro and Dong Hui Lim (* equal contributor)

Frontiers in Medicine, vol. 10, pp. 1162124, 2023.

[#146] Stereoscopic Vision Recalling Memory for Monocular 3D Object Detection

Jung Uk Kim, Hyung-Il Kim, and Yong Man Ro

IEEE Transactions on Image Processing, vol. 32, pp. 2749-2760, 2023.

[#145] Advancing Adversarial Training by Injecting Booster Signal

Hong Joo Lee, Youngjoon Yu, and Yong Man Ro

IEEE Transactions on Neural Networks and Learning Systems, vol. 35, no. 9, pp. 12665-12677, Sept. 2024

[#144] Defending Person Detection Against Adversarial Patch Attack by using Universal Defensive Frame

Youngjoon Yu, Hong Joo Lee, Hakmin Lee, and Yong Man RoHakmin Lee and Yong Man Ro

IEEE Transactions on Image Processing, vol. 31, pp. 6976-6990, 2022.

[#143] Face Shape-Guided Deep Feature Alignment for Face Recognition Robust to Face Misalignment

Hyung-Il Kim, Kimin Yun, and Yong Man Ro

IEEE Transactions on Biometrics, Behavior, and Identity Science, vol. 4, no. 4, pp. 556-569, 2022.

[#142] CroMM-VSR: Cross-Modal Memory Augmented Visual Speech Recognition

Minsu Kim, Joanna Hong, Sejin Park, Yong Man Ro

IEEE Transactions on Multimedia, vol. 24, pp. 4342-4355, 2022.

[#141] Assessing Individual VR Sickness Through Deep Feature Fusion of VR Video and Physiological Response

Sangmin Lee, Seongyeop Kim, Hak Gu Kim, and Yong Man Ro

IEEE Transactions on Circuits and Systems for Video Technology, vol. 136, pp. 2895-2907, 2022.

[#140] Uncertainty-Guided Cross-Modal Learning for Robust Multispectral Pedestrian Detection

Jung Uk Kim, Sungjune Park, Yong Man Ro

IEEE Transactions on Circuits and Systems for Video Technology, vol. 32, no. 3, pp. 1510-1523, 2022.

[#139] On-the-Fly Facial Expression Prediction Using LSTM Encoded Appearance-Suppressed Dynamics

Wissam J. Baddar, Sangmin Lee, and Yong Man Ro

IEEE Transactions on Affective Computing, vol. 13, no. 1, pp. 159-174, 2022.

[#138] Robust Perturbation for Visual Explanation:Cross-checking Mask Optimization to Avoid Class Distortion

Junho Kim, Seongyeop Kim, Seong Tae Kim, and Yong Man Ro

IEEE Transactions on Image Processing, vol. 31, pp. 301-313, 2022.

[#137] Speech Reconstruction with Reminiscent Sound via Visual Voice Memory

Joanna Hong, Minsu Kim, Se Jin Park, Yong Man Ro

IEEE Transactions on Audio Speech and Language Processing, vol. 29, pp. 3654-3667, 2021.

[#136] CUA Loss: Class Uncertainty-Aware Gradient Modulation for Robust Object Detection

Jung Uk Kim, Seong Tae Kim, Hong Joo Lee, Sangmin Lee, and Yong Man Ro

IEEE Transactions on Circuits and Systems for Video Technology, vol. 31, no. 9, pp. 3529-3543, 2021.

[#135] Adversarially Robust Hyperspectral Image Classification via Random Spectral Sampling and Spectral Shape Encoding

Sungjune Park, Hong Joo Lee, Yong Man Ro

IEEE Access, vol. 9, pp. 66791-66804, 2021.

[#134] Robust Video Frame Interpolation with Exceptional Motion Map

Minho Park, Hak Gu Kim, Sangmin Lee, and Yong Man Ro

IEEE Transactions on Circuits and Systems for Video Technology, vol. 31, no. 2, pp. 754-764, 2021.

[#133] Multimodal Faical Biometrics Recognition: Dual-stream Convolutional Neural Networks with Multi-feature Fusion Layers

Leslie Ching Ow Tiong, Seong Tae Kim, and Yong Man Ro

Image and Vision Computing (Elsevier), vol. 102, pp. 103977, 2020.

[#132] Dual-branch structured de-striping convolution network using parametric noise model

Jongho Lee and Yong Man Ro

IEEE Access , vol. 8, pp. 155519-155528, 2020.

[#131] Deep Virtual Reality Image Quality Assessment with Human Perception Guider for Omnidirectional Image

Hak Gu Kim, Heoun-taek Lim, and Yong Man Ro

IEEE Transactions on Circuits and Systems for Video Technologyk, vol. 30, no. 4, pp. 917-928, 2019.

[#130] BBC Net: Bounding-Box Critic Network for Occlusion-Robust Object Detection

Jung Uk Kim, Jungsu Kwon, Hak Gu Kim, and Yong Man Ro

IEEE Transactions on Circuits and Systems for Video Technology, vol. 30, no. 4, pp. 1037-1050, 2019.

[#129] Encoding Features Robust to Unseen Modes of Variation with Attentive Long Short-Term Memory

Wissam J. Baddar and Yong Man Ro

Pattern Recognition, vol. 100, 107159, 2020.

[#128] Lightweight and Effective Facial Landmark Detection using Adversarial Learning with Face Geometric Map Generative Network

Hong Joo Lee, Seong Tae Kim, Hakmin Lee and Yong Man Ro

IEEE Transactions on Circuits and Systems for Video Technology, vol. 30, no. 3, pp. 771-780, 2019.

[#127] MCSIP Net: Multi-Channel Satellite Image Prediction via Deep Neural Network

Jae-Hyeok Lee, Sangmin S. Lee, Hak Gu Kim, Sa-kwang Song, Seongchan Kim, and Yong Man Ro

IEEE Transactions on Geoscience and Remote Sensing (TGRS), vol. 58, no. 3, pp. 2212-2224, 2019.

[#126] BMAN: Bidirectional Multi-scale Aggregation Networks for Abnormal Event Detection

Sangmin Lee, Hak Gu Kim, and Yong Man Ro

IEEE Transactions on Image Processing, vol. 29, pp. 2395-2408, 2019.

[#125] Multi-Objective Based Spatio-Temporal Feature Representation Learning Robust to Expression Intensity Variations for Facial Expression Recognition

Dae Hoe Kim, Wissam J. Baddar, Jinhyeok Jang and Yong Man Ro

IEEE Transactions on Affective Computing, vol. 10, no. 2, pp. 223-236, 2017.

[#124] Endometrium Segmentation on TVUS Image Using Key-point Discriminator

Hong Joo Lee, Hyenok Park, Hak Gu Kim, Dongkuk Shin, Sa Ra Lee, Sung Hoon Kim, Mikyung Kong and Yong Man Ro

Medical Physics, vol. 46. no. 9, pp. 3974-3984, 2019.

[#123] Implementation of Multimodal Biometric Recognition via Multi-feature Deep Learning Networks and Feature Fusion

Leslie Ching Ow Tiong, Seong Tae Kim and Yong Man Ro

Multimedia Tools and Applications, vol. 78, pp. 22743-22772, 2019.

[#122] Binocular Fusion Net: Deep Learning Visual Comfort Assessment for Stereoscopic 3D

Hak Gu Kim, Hyunwook Jeong (equally contributed), Heoun-taek Lim and Yong Man Ro

IEEE Transactions on Circuits and Systems for Video Technology, vol. 29, no. 4, pp. 956-967, 2018.

[#121] VRSA Net: VR Sickness Assessment considering Exceptional Motion for 360-degree VR Video

Hak Gu Kim, Heoun-taek Lim, Sangmin Lee and Yong Man Ro

IEEE Transactions on Image Processing, vol. 28, no. 4, pp. 1646-1660, 2018.

[#120] Attended Relation Feature Representation of Facial Dynamics for Facial Authentication

Seong Tae Kim and Yong Man Ro

IEEE Transactions on Information Forensics & Security, vol. 14, no. 7, pp. 1768-1778, 2018.

[#119] Visually Interpretable Deep Network for Diagnosis of Breast Masses on Mammograms

Seong Tae Kim, Jae-Hyeok Lee, Hakmin Lee and Yong Man Ro

Physics in Medicine & Biology, vol. 63, no. 23, 235025, 2018.

[#118] Ultrafast Layer Based Computer-Generated Hologram Calculation with Sparse Template Holographic Fringe Pattern for 3-D Object

Hak Gu Kim and Yong Man Ro

Optics Express, vol. 25, no. 24, pp. 30418 - 30427, 2017.

[#117] Multi-View Stereoscopic Video Hole Filling Considering Spatio-Temporal Consistency and Binocular Symmetry for Synthesized 3D Video

Hak Gu Kim and Yong Man Ro

IEEE Transactions on Circuits and Systems for Video Technology, vol. 27, no. 7, pp. 1435-1449, 2016.

[#116] Multi-Objective based Spatio-Temporal Feature Representation Learning Robust to Expression Intensity Variations for Facial Expression Recognition

Dae Hoe Kim, Wisam J. Baddar and Yong Man Ro

IEEE Transactions on Affective Computing, vol. 10, no. 2, pp. 223-236, 2017.

[#115] Effective and Efficient Human Action Recognition Using Dynamic Frame Skipping and Trajectory Refection

Jeong-Jik Seo, Hyung-Il Kim, Wesley De Neve and Yong Man Ro

Image and Vision Computing, vol. 58, pp. 76-85, 2017.

[#114] Latent Feature Representation with Depth Directional Long-Term Recurrent Learning for Breast Masses in Digital Breast Tomosynthesis

Dae Hoe Kim, Seong Tae Kim, Jung Min Chang and Yong Man Ro

Physics in Medicine & Biology, vol. 62, no. 3, pp. 1009 - 1031, 2017.

[#113] Experimental Investigation of Facial Expressions Associated with Visual Discomfort: Feasibilty Study Towards an Objective Measurement of Visual Discomfort Based on Facial Expression

Seong-Il Lee, Seung Ho Lee, Konstantinos N. Plataniotis and Yong Man Ro

IEEE/OSA Journal of Display Technology, vol. 12, no. 12, pp. 1785-1797, 2016.

[#112] Acceleration of Calculation Speed of Computer-Generated Holograms Using the Sparsity of the Holographic Fringe Pattern for 3D Object

Hak Gu Kim, Hyunwook Jeong and Yong Man Ro

Optics Express, vol. 24, no. 22, pp. 25317 - 25328, 2016.

[#111] Experimental Investigation of the Effect of Binocular Disparity on the Visibility Threshold of Asymmetric Noise in Stereoscopic Viewing

Hak Gu Kim, Seong-Il Lee (equally contributed) and Yong Man Ro

Optics Express, vol. 24, no. 17, pp. 19607 - 19615, 2016.

[#110] Critical Binocular Asymmetry Measure for Perceptual Quality Assessemnt of Synthesized Stero 3D Images in View Synthesis

Yong Ju Jung, Hak Gu Kim and Yong Man Ro

IEEE Transactions on Circuits and Systems for Video Technology, vol. 26, no. 7, pp. 1201 - 1214, 2015.

[#109] Collaborative Expression Representation Using Peak Expression and Intra Variation Face Images for Practical Subject-Independent Emotion Recognition in Videos

Seung Ho Lee, Wissam J. Baddar and Yong Man Ro

Pattern Recognition, vol. 54, pp. 52 - 67, 2016.

[#108] Feature Scalability for a Low Complexity Face Recognition with Unconstrained Spatial Resolution

Hyung-Il Kim, Seung Ho Lee, Jae-Young Choi and Yong Man Ro

Multimedia Tools and Applications, vol. 75, no. 12, pp. 6887 - 6908, 2016.

[#107] Classifier Ensemble Generation and Selection with Multiple Feature Representations for Classification Applications in Computer-Aided Detection and Diagnosis on Mammography

Jae Young Choi, Dae Hoe Kim, Konstantinos N. Plataniotis and Yong Man RO

Expert Systems with Applications, vol. 46, pp. 106 - 121, 2016.

[#106] Partial Matching of Facial Expression Sequence Using Over-complete Transition Dictionary for Emotion Recognition

Seung Ho Lee and Yong Man Ro

IEEE Transactions on Affective Computing, vol. 7, no. 4, pp. 389 - 408, 2015.

[#105] Detection of Masses in Digital Breast Tomosynthesis Using Complementary Information of Simulated Projection

Seong Tae Kim, Dae Hoe Kim and Yong Man Ro

Medical Physics, vol. 42, no. 12, pp. 7043 - 7058, 2015.

[#104] Improving Mass Detection Using Combined Feature Representations from Projection Views and Reconstructed Volume of DBT and Boosting Based Classification with Feature Selection

Dae Hoe Kim, Seong Tae Kim and Yong Man Ro

Physics in Medicine and Biology, vol. 60, no. 22, pp. 8809 - 8832, 2015.

[#103] Image-Based Coin Recognition Using Rotation-Invariant Region Binary Patterms Based on Gradient Magnitudes

Semin Kim, Seung Ho Lee and Yong Man Ro

Journal of Visual Communication and Image Representation, vol. 32, pp. 217 - 223, 2015.

[#102] Towards a Physiology-based Measure of Visual Discomfort: Brain Activity Measurement While Viewing Stereoscopic Images with Different Screen Disparities

Yong Ju Jung, Dongchan Kim, Hosik Sohn, Seong-il Lee, Hyun Wook Park, and Yong Man Ro

IEEE/OSA Journal of Display Technology, vol. 11, no. 9, pp. 730-743, 2015.

[#101] Region Based Stellate Features Combined with Variable Selection Using AdaBoost Learning in Mammographic Computer-aided Detection

Dae Hoe Kim, Jae Young Choi, and Yong Man Ro

Computers in Biology and Medicine, vol. 63, pp. 238-250, 2015.

[#100] Breast mass detection using slice conspicuity in 3D reconstructed digital breast volumes

Seong Tae Kim, Dae Hoe Kim, and Yong Man Ro

Physics in Medicine and Biology, vol. 59, no. 17, pp. 5003-5023, 2014.

1 2 3

Page updated

Report abuse