Deep Learning Approaches for Text-to-Image Generation: Progress and Performance Analysis

Authors

  • Dr. SANJAY E. PATE Assistant Professor, Department of Computer Science, Nanasaheb Yashwantrao Narayanrao Chavan Arts, Science & Commerce College, Chalisgaon.Dist.Jalgaon, India Author

DOI:

https://doi.org/10.70849/IJSCI

Keywords:

Text-to-image, Diffusion models, GANs, DALL·E, Imagen, Natural language processing, Image synthesis

Abstract

Synthesizing images from natural language descriptions is the subject of the quickly developing artificial intelligence field known as "text-to-image generation." This study examines the latest developments in deep learning techniques, with a focus on Generative Adversarial Networks (GANs) and diffusion models like Imagen, GLIDE, and DALL•E. The model architectures, training datasets, and evaluation measures are all thoroughly compared. The paper also emphasizes the limits of existing systems and real-world applications. We wrap up with recommendations for future study, highlighting the necessity of increased fidelity, controllability, and ethical considerations when deploying models.

Downloads

Published

01-08-2025

How to Cite

[1]
Dr. SANJAY E. PATE, “Deep Learning Approaches for Text-to-Image Generation: Progress and Performance Analysis”, Int. J. Sci. Inno. Eng., vol. 2, no. 8, pp. 1–4, Aug. 2025, doi: 10.70849/IJSCI.