Amir Kasaei | Home Page

About

Learn more about me

Biography

I’m a Master’s student in Computer Engineering at Sharif University of Technology. Alongside my studies, I’m actively involved in research as a Research Assistant at the RIML lab, where I focus on Compositional Visual Generation problems in Machine Learning. This allows me to blend theory with real-world applications, pushing the boundaries of what AI can create. My research centers around teaching machines to generate images by combining different elements in creative ways—a field that merges AI, visual computation, and problem-solving. I’m passionate about finding innovative solutions and advancing what AI can achieve. I love collaborating on projects that challenge me to think differently, and I’m always open to connecting with others who share an interest in machine learning, AI, or tech in general.

Birthday: 28 May 2001
Website: amirkasaei.com
City: Tehran, Iran

Age: 24
Degree: M.Sc. of Computer Engineering
Email: a.kasaei@me.com

Research Interests

Computer Vision
Compositional Generation

Text-to-image Generation
Diffusion Models

Visual Reasoning
Multimodal Models

Projects

Licenses & Certifications

Skills

Languages

Experiences

Interests

Education

My Education

University

Master of Science in Computer Enigneering

September 2023 - Present

Sharif University of Technology

GPA: 4

Grade: 18.54

Bachelor of Science in Computer Enigneering

September 2019 - July 2023

University of Guilan

GPA: 3.98

Grade: 19.58

School

High School

September 2015 - July 2019

Dr. Moein

Field: Mathematics & Physics

GPA: 4

Grade: 19.94

Middle School

September 2012 - July 2015

Begher Al'Olum

GPA: 4

Grade: 20

Honors & Awards

My Honors & Awards

Ranked 1st among B.Sc. Computer Engineering Students (Certificate in Persian)

Sep 2019 – Aug 2023

M.Sc of Engineering Admission at Sharif University of Technology as an Exceptional Talented Student

Sep 2023

Publications

My Publications

CARINOX: Inference-Time Scaling with Category-Aware Reward-Based Initial Noise Optimization and Exploration

Seyed Amir Kasaei, Ali Aghayari, Arash Marioriyad, Niki Sepasian, Shayan Baghayi Nejad, MohammadAmin Fazli, Mahdieh Soleymani Baghshah, Mohammad Hossein Rohban

Transactions on Machine Learning Research (TMLR), 2026.
A reward-guided optimization framework for initial noise in diffusion models, combining discrete exploration and continuous refinement for compositional alignment.

arXiv:2509.17458 | OpenReview

Evaluating the Evaluators: Metrics for Compositional Text-to-Image Generation

Seyed Amir Kasaei, Ali Aghayari, Arash Marioriyad, Niki Sepasian, MohammadAmin Fazli, Mahdieh Soleymani Baghshah, Mohammad Hossein Rohban

NeurIPS 2025 Workshop GenProCC: Generative and Protective AI for Content Creation.
A correlation study of compositional text-to-image metrics, benchmarking category-specific and cross-category agreement with human evaluations.

arXiv:2509.21227 | OpenReview

Weeding Out Bad Seeds: Noise-Robust Unlearning for Text-to-Image Diffusion Models

Arian Komaei Koma, Seyed Amir Kasaei, Ali Aghayari, Aida Aryafar, Matin Ghiasi, Amirhossein Souri, Mohammad Mosayyebi, AmirMahdi Sadeghzadeh, Mohammad Hossein Rohban

Under review at NeurIPS 2026.
Adaptive noise sampling for unlearning in text-to-image diffusion models.

Hidden Meanings in Plain Sight: RebusBench for Evaluating Cognitive Visual Reasoning

Seyed Amir Kasaei, Arash Marioriyad, Mahbod Khaleti, MohammadAmin Fazli, Mahdieh Soleymani Baghshah, Mohammad Hossein Rohban

ICLR 2026 Workshop: From Human Cognition to AI Reasoning (HCAIR).
RebusBench is a benchmark of 1,164 rebus puzzles for evaluating abstract cognitive visual reasoning in LVLMs.

arXiv:2604.01764 | OpenReview

Erasure or Erosion? Evaluating Compositional Degradation in Unlearned Text-To-Image Diffusion Models

Arian Komaei Koma, Seyed Amir Kasaei, Ali Aghayari, AmirMahdi Sadeghzadeh, Mohammad Hossein Rohban

CVPR 2026 Workshop on Machine Unlearning for Computer Vision.
Evaluating unlearned diffusion models through the lens of compositionality.

arXiv:2604.04575

Erased but Exploitable: Black-box Embedding-Aware Prompting Against Unlearned Text-to-Image Diffusion Models

Arian Komaei Koma, Seyed Amir Kasaei, Amirhossein Arefzadeh, Roham Izadidoost, AmirMahdi Sadeghzadeh, Mohammad Hossein Rohban

Under review at TMLR.
A black-box embedding-aware adversarial prompting framework for attacking unlearned text-to-image diffusion models.

Hallucination as an Upper Bound: A New Perspective on Text-to-Image Evaluation

Seyed Amir Kasaei, Mohammad Hossein Rohban

NeurIPS 2025 Workshop GenProCC: Generative and Protective AI for Content Creation.
Interprets hallucination as an upper bound for assessing semantic faithfulness in text-to-image systems.

arXiv:2509.21257 | OpenReview

Skills

My Skills

Fields

Compositional Text-to Image Generation

Research Assistant at RIML Lab
Deep Learning Course at Sharif University of Technology

Deep Learning

Deep Learning Course at Sharif University of Technology
Deep Learning Course at University of Guilan
Deep Learning Specialization at DeepLearning.AI
Machine Learning Specialization at DeepLearning.AI

Web Developement

HTML
CSS
PHP
MYSQL

Photography

Artworks here at

Programming Languages & Libraries

Python

Tensorflow

Pytorch

Java

C++

HTML

CSS

Bootstrap

PHP

MYSQL

VHDL

Experiences

My Experiences

Research Assistant

Research Assistant

September 2023 - Present

RIML Lab (Sharif Univeristy of Technology)

Supervisor: Dr M H Rohban

Skills: Machine Learning . Deep Learning . NLP . Computer Vision . Compositional Generation . Medical Image Analysis

Research Assistant

September 2022 - Present

Univeristy of Guilan

Supervisor: Dr S A Mirroshandel

Skills: Machine Learning . Deep Learning . NLP . Computer Vision . Machine Translation . Medical Image Analysis

Teaching Assistant

Head Teaching Assistant of Data Mining

September 2025 - January 2026

Instructor: Dr MA Fazli

Skills: Deep Learning · Data Anlaysis · Data Mining

Teaching Assistant of System 2 in AI

February 2025 - June 2025

Instructor: Dr M H Rohban, Dr M Soleymani

Skills: Compositional Generation · Inferenece-Time Optimization · LLM · Reasoning

Teaching Assistant of Deep Learning

February 2025 - June 2025

Instructor: Dr M Soleymani

Skills: Optimization · Diffusion · Image Generation

Teaching Assistant of Intelligent Analysis of Biomedical Images

September 2024 - January 2025

Instructor: Dr M H Rohban

Skills: Deep Learning · Medical Image Analysis · Trustworthy . Interpretability

Certificates

My Certificates

Deep Learning Specialization

DeepLearning.AI

Instructor: Andrew Ng

Aug 2022

Machine Learning Specialization

DeepLearning.AI

Instructor: Andrew Ng

Sep 2022

Univeristy of Alberta

Fundamentals of Reinforcement Learning

University of Alberta

Instructor: Andrew Ng

Aug 2022

Crash Course on Python

Google

Instructor: Christine Rafla

Aug 2022

Contact

Contact Me

My Address

Tehran, Tehran Province, Iran

Social Profiles

Email Me

a.kasaei@me.com

Designed by amirkasaei