About

Learn more about me

Amir Kasaei

Biography

I’m a Master’s student in Computer Software Engineering at Sharif University of Technology. Alongside my studies, I’m actively involved in research as a Research Assistant at the RIML lab, where I focus on Compositional Visual Generation problems in Machine Learning. This allows me to blend theory with real-world applications, pushing the boundaries of what AI can create. My research centers around teaching machines to generate images by combining different elements in creative ways—a field that merges AI, visual computation, and problem-solving. I’m passionate about finding innovative solutions and advancing what AI can achieve. I love collaborating on projects that challenge me to think differently, and I’m always open to connecting with others who share an interest in machine learning, AI, or tech in general.

Research Interests

  • Computer vision
  • Compositional Generation
  • Text-to-image Generation
  • Diffusion Models
  • Visual Reasoning
  • Multimodal Models

Projects

Licenses & Certifications

Skills

Languages

Experiences

Interests

Education

My Education

University

Master of Science in Computer Software Enigneering

September 2023 - Present

Sharif University of Technology

GPA: 4

Grade: 18.68

Bachelor of Science in Computer Enigneering

September 2019 - July 2023

University of Guilan

GPA: 3.98

Grade: 19.58

School

High School

September 2015 - July 2019

Dr. Moein

Field: Mathematics & Physics

GPA: 4

Grade: 19.94

Middle School

September 2012 - July 2015

Begher Al'Olum

GPA: 4

Grade: 20

Honors & Awards

My Honors & Awards

Ranked 1st among B.Sc. Computer Engineering Students (Certificate in Persian)
Sep 2019 – Aug 2023
M.Sc of Software Engineering Admission at Sharif University of Technology as an Exceptional Talented Student
Sep 2023
Tuition Waiver, Bachelor of Science, University of Guilan
Sep 2023

Publications

My Publications

Google Scholar

CARINOX: Inference-Time Scaling with Category-Aware Reward-Based Initial Noise Optimization and Exploration

Seyed Amir Kasaei, Ali Aghayari, Arash Marioriyad, Niki Sepasian, Shayan Baghayi Nejad, MohammadAmin Fazli, Mahdieh Soleymani Baghshah, Mohammad Hossein Rohban

Under review at Transactions on Machine Learning Research (TMLR).
A reward-guided optimization framework for initial noise in diffusion models, combining discrete exploration and continuous refinement for compositional alignment.

arXiv, 2025 arXiv:2509.17458

Evaluating the Evaluators: Metrics for Compositional Text-to-Image Generation

Seyed Amir Kasaei, Ali Aghayari, Arash Marioriyad, Niki Sepasian, MohammadAmin Fazli, Mahdieh Soleymani Baghshah, Mohammad Hossein Rohban

NeurIPS 2025 Workshop GenProCC: Generative and Protective AI for Content Creation.
A correlation study of compositional text-to-image metrics, benchmarking category-specific and cross-category agreement with human evaluations.

arXiv, 2025 arXiv:2509.21227

Hallucination as an Upper Bound: A New Perspective on Text-to-Image Evaluation

Seyed Amir Kasaei, Mohammad Hossein Rohban

NeurIPS 2025 Workshop GenProCC: Generative and Protective AI for Content Creation.
Interprets hallucination as an upper bound for assessing semantic faithfulness in text-to-image systems.

arXiv, 2025 arXiv:2509.21257

Skills

My Skills

Fields

Compositional Text-to Image Generation

  • Research Assistant at RIML Lab
  • Deep Learning Course at Sharif University of Technology

Deep Learning

  • Deep Learning Course at Sharif University of Technology
  • Deep Learning Course at University of Guilan
  • Deep Learning Specialization at DeepLearning.AI
  • Machine Learning Specialization at DeepLearning.AI

Web Developement

  • HTML
  • CSS
  • PHP
  • MYSQL

Photography

  • Artworks here at

Programming Languages & Libraries

Python

Tensorflow

PyTorch icon

Pytorch

Java

C++

HTML

CSS

Bootstrap

PHP

MYSQL

VHDL

Experiences

My Experiences

Research Assistant

Research Assistant

September 2023 - Present

RIML Lab (Sharif Univeristy of Technology)

Supervisor: Dr M H Rohban

Skills: Machine Learning . Deep Learning . NLP . Computer Vision . Compositional Generation . Medical Image Analysis

Research Assistant

September 2022 - Present

Univeristy of Guilan

Supervisor: Dr S A Mirroshandel

Skills: Machine Learning . Deep Learning . NLP . Computer Vision . Machine Translation . Medical Image Analysis

Teaching Assistant

Head Teaching Assistant of Data Mining

September 2025 - January 2026

Instructor: Dr MA Fazli

Skills: Deep Learning · Data Anlaysis · Data Mining

Teaching Assistant of System 2 in AI

February 2025 - June 2025

Instructor: Dr M H Rohban, Dr M Soleymani

Skills: Compositional Generation · Inferenece-Time Optimization · LLM · Reasoning

Teaching Assistant of Deep Learning

February 2025 - June 2025

Instructor: Dr M Soleymani

Skills: Optimization · Diffusion · Image Generation

Teaching Assistant of Intelligent Analysis of Biomedical Images

September 2024 - January 2025

Instructor: Dr M H Rohban

Skills: Deep Learning · Medical Image Analysis · Trustworthy . Interpretability

Certificates

My Certificates

DeepLearningAI

Deep Learning Specialization

DeepLearning.AI

Instructor: Andrew Ng

Aug 2022

DeepLearningAI

Machine Learning Specialization

DeepLearning.AI

Instructor: Andrew Ng

Sep 2022

Univeristy of Alberta

Fundamentals of Reinforcement Learning

University of Alberta

Instructor: Andrew Ng

Aug 2022

Google

Crash Course on Python

Google

Instructor: Christine Rafla

Aug 2022

Contact

Contact Me

My Address

Tehran, Tehran Province, Iran

Social Profiles

Email Me

a.kasaei@me.com

Designed by amirkasaei