Computer Vision researcher and AI developer with deep expertise in Generative AI, image synthesis, and automated image understanding. Specialized in designing, training, and deploying deep learning architectures for tasks such as image generation, segmentation, object detection, and image-to-image translation. Proficient in advanced frameworks including PyTorch and TensorFlow, I have hands-on experience with Neural Networks, GANs, diffusion models, and vision transformers. My research integrates technical innovation with real-world applications.
Soongsil University (2022–2024)
GPA: 4.34/4.50
Thesis: Korean Font Generation using Position-based Components (YOLOv8, GANs)
MUET SZAB Campus Khairpur Mir's (2016–2021)
GPA: 3.86/4.00
Thesis: FIS Hostel (Food Internet Security)
System Software Lab, Soongsil University — Seoul, South Korea
Sep 2022 – PresentCubix — Karachi, Pakistan
Feb 2022 – Aug 2022Developed a system to generate 2,780 Korean characters using only 43 handwritten samples. Utilized YOLOv8 for efficient character detection and segmentation, and PACGAN for high-quality font style synthesis.
Tech: YOLOv8, GANs (PACGAN), PyTorch, Korean Font Generation
View Project (mywriting.kr)Developed a GAN-based model for enhancing handwriting quality by blending styles. Published in MDPI Electronics. Implemented novel fusion techniques for style transfer.
Tech: GANs, Style Transfer, Computer Vision, Research
View Code
I contributed to and executed this open-source project to deepen my understanding of real-time object detection. The system uses YOLOv8 and OpenCV to detect and track multiple objects in live video streams, with alerting and event logging features.
Tech: YOLOv8, OpenCV, Python, Deep Learning
I worked on this open-source project to gain hands-on experience in medical image segmentation. Using a U-Net-based model, I segmented tumors in MRI scans, supporting diagnostic workflows.
Tech: U-Net, Medical Imaging, PyTorch
I executed this open-source project to explore fine-grained classification using Vision Transformers (ViT). The pipeline distinguishes between visually similar bird species using transfer learning and data augmentation.
Tech: Vision Transformers, Image Classification, PyTorch
I contributed to this open-source project to enhance my skills in semantic segmentation for autonomous driving. The DeepLabV3+ model identifies road lanes, vehicles, and pedestrians in urban scenes.
Tech: DeepLabV3+, Semantic Segmentation, TensorFlow
I worked on this open-source project to develop a multi-class object detection system for aerial drone imagery using Faster R-CNN. The model detects buildings, vehicles, and infrastructure in high-resolution images.
Tech: Faster R-CNN, Aerial Imagery, Deep Learning
Blockchain-powered gaming platform developed at Cubix. Integrated smart contracts using Solidity to enable secure and transparent in-game transactions. Developed backend APIs with Node.js.
Tech: Blockchain, Smart Contracts, Solidity, Node.js
DetailsDeveloped Python tools for automated web scraping, data processing, and journal formatting. Created interactive data visualizations using D3.js, Plotly, and Matplotlib.
Tech: Python, Web Scraping, Data Visualization
View CodeAndroid app to help students locate hostels in other cities. Developed in Java and XML with funding support from Ignite Pakistan.
Tech: Android App, Java, XML
DetailsAvinash Kumar, Irfanullah Memon, Abdul Sami, Youngwon Jo, Jaeyoung Choi
Electronics, 14(13), 2699, 2025
Read PaperAbdul Sami, Avinash Kumar, Youngwon Jo, Irfanullah Memon, Muhammad Rizwan, Jaeyoung Choi
ICOIN 2025, Chiang Mai, Thailand, 2025
Read PaperYoungwon Jo, Avinash Kumar, Uijong Yang, Daeun Kim, Jaeyoung Choi
Annual Conference on Human and Language Technology, 2024, p 50-55
Read PaperAvinash Kumar, Irfanullah Memon, Abdul Sami, Youngwon Jo, Jaeyoung Choi
SSRN, 2024
View AbstractAvinash Kumar, Kyeolhee Kang, Ammar ul Hassan, Jaeyoung Choi
MDPI Electronics, 2023
Read PaperHyston Kayange, Avinash Kumar, Yejung Lee, Hoonseo Jung, Jongsun Choi
Journal of the Korean Information Science Society, 2023
Read PaperAvinash Kumar, Kyeolhee Kang, Ammar ul Hassan, Jaeyoung Choi
MITA 2023, Technical University of Ostrava, Czech Republic, 2023
Read Paper2025.04.11 | Patent Application (10-2025-0047215, submitted)
Authors: Jayoung Choi, Irfanuulah Memon, Avinash Kumar
2025.04.08 | Patent Application (10-2025-0045652, submitted)
Authors: Jayoung Choi, Avinash Kumar
2025.03.18 | Patent Application (10-2025-0034957, submitted)
Authors: Jayoung Choi, Irfanuulah Memon, Avinash Kumar, Youngwon Jo
2025.03.17 | Patent Application (10-2025-0034154, submitted)
Authors: Jayoung Choi, Avinash Kumar, Youngwon Jo
MITA Conference 2023
Recognized for innovative work on font refinement using GANs.
Soongsil University (2022-2024)
Awarded Professor's Scholarship for Master's Degree.
University of Tokyo, Japan (2018)
Selected as top student for participation in a cybersecurity hackathon.
Mehran University of Engineering and Technology (2016-2021)
Higher Education Commission Scholarship for Bachelor's Degree.
Coursera • Issued Dec 2022
Credential ID: UNR55BUM63YP
Show CredentialCoursera • Issued Nov 2022
Credential ID: JTFMNC28NUSP
Show CredentialCoursera • Issued May 2020
Credential ID: YKUGZD7B53NC
Show CredentialGANs, Diffusion Models, Neural Style Transfer, Text-to-Image Synthesis, Image Generation, Font Generation.
Image-to-Image Translation, Document Analysis, Object Detection, Image Classification, Semantic Segmentation.
Text-Conditioned Image Generation, Cross-modal Retrieval, Vision-Language Models.
I am particularly interested in developing novel deep learning techniques for solving challenging problems in Generative AI and Computer Vision. My current research focuses on improving the quality and diversity of generated images, enhancing the performance of generation models, and exploring the intersection of vision and language.