Publications

2025

CLoRA: A Contrastive Approach to Compose Multiple LoRA Models

Tuna Han Salih Meral^*, Enis Simsar^*, Federico Tombari, Pinar Yanardag

ICCV 2025 (Highlight)

CLoRA is a training-free method that works on test-time, and uses contrastive learning to compose multiple concept and style LoRAs simultaneously. ^* denotes equal contribution.

Paper Project Page Code

@article{meral2024clora, title={Clora: A contrastive approach to compose multiple lora models}, author={Meral, Tuna Han Salih and Simsar, Enis and Tombari, Federico and Yanardag, Pinar}, journal={arXiv preprint arXiv:2403.19776}, year={2024} }

2025

ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features

Alec Helbling, Tuna Han Salih Meral, Ben Hoover, Pinar Yanardag, Duen Horng Chau

ICML 2025 (Oral)

Without requiring additional training, ConceptAttention repurposes the parameters of DiT attention layers to produce highly contextualized concept embeddings, contributing the major discovery that performing linear projections in the output space of DiT attention layers yields significantly sharper saliency maps compared to commonly used cross-attention mechanisms.

Paper HF Space Code

@inproceedings{helbling2025conceptattention, title={ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features}, author={Helbling, Alec and Meral, Tuna and Hoover, Benjamin and Yanardag, Pinar and Chau, Polo}, booktitle={International Conference on Machine Learning}, year={2025} }

2024

CONFORM: Contrast is All You Need For High-Fidelity Text-to-Image Diffusion Models

Tuna Han Salih Meral, Enis Simsar, Federico Tombari, Pinar Yanardag

CVPR 2024

Images produced by text-to-image diffusion models might not always faithfully represent the semantic intent of the provided text prompt where the model might overlook or entirely fail to produce certain objects. While recent studies propose various solutions, they often require customly tailored functions for each of these problems, leading to sub-optimal results, especially for complex prompts. Our work introduces a novel perspective by tackling this challenge in a contrastive context. Our approach intuitively promotes the segregation of objects in attention maps, while also maintaining that pairs of related attributes are kept close to each other.

Paper Project Page Code

{@inproceedings{meral2024conform, title={Conform: Contrast is all you need for high-fidelity text-to-image diffusion models}, author={Meral, Tuna Han Salih and Simsar, Enis and Tombari, Federico and Yanardag, Pinar}, booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition}, pages={9005--9014}, year={2024} }

2026

MotionFlow: Attention-Driven Motion Transfer in Video Diffusion Models

Tuna Han Salih Meral, Hidir Yesiltepe, Connor Dunlop, Pinar Yanardag

AAAI 2026 (Main Technical Track)

MotionFlow is a training-free method that leverages attention for motion transfer. Our method has been accepted to the AAAI 2026 Main Technical Track and can successfully transfer a wide variety of motion types, ranging from simple to complex motion patterns.

Paper Project Page

@article{meral2024motionflow, title={MotionFlow: Attention-Driven Motion Transfer in Video Diffusion Models}, author={Meral, Tuna Han Salih and Yesiltepe, Hidir and Dunlop, Connor and Yanardag, Pinar}, journal={arXiv preprint arXiv:2412.05275}, year={2024} }

2024

MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance

Hidir Yesiltepe, Tuna Han Salih Meral, Pinar Yanardag

Preprint

In this work, we propose the first motion transfer approach in diffusion transformer through Mixture of Score Guidance (MSG), a theoretically-grounded framework for motion transfer in diffusion models.

Paper Project Page

@article{meral2024motionshop, title=@article{yesiltepe2024motionshop, title={MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance}, author={Yesiltepe, Hidir and Meral, Tuna Han Salih and Dunlop, Connor and Yanardag, Pinar}, journal={arXiv preprint arXiv:2412.05355}, year={2024} }

2024

Conditional Information Gain Trellis

Ufuk Can Bicici, Tuna Han Salih Meral, Lale Akarun

Pattern Recognition Letters

Conditional computing processes an input using only part of the neural network’s computational units. Learning to execute parts of a deep convolutional network by routing individual samples has several advantages: This can facilitate the interpretability of the model, reduce the model complexity, and reduce the computational burden during training and inference. Furthermore, if similar classes are routed to the same path, that part of the network learns to discriminate between finer differences and better classification accuracies can be attained with fewer parameters. Recently, several papers have exploited this idea to select a particular child of a node in a tree-shaped network or to skip parts of a network. In this work, we follow a Trellis-based approach for generating specific execution paths in a deep convolutional neural network. We have designed routing mechanisms that use differentiable information gain-based cost functions to determine which subset of features in a convolutional layer will be executed. We call our method Conditional Information Gain Trellis (CIGT).

Paper

@article{bicici2024conditional, title={Conditional Information Gain Trellis}, author={Bicici, Ufuk Can and Meral, Tuna Han Salih and Akarun, Lale}, journal={Pattern Recognition Letters}, volume={184}, pages={212--218}, year={2024}, publisher={Elsevier} }

Tuna Meral

Publications

CLoRA: A Contrastive Approach to Compose Multiple LoRA Models

ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features

CONFORM: Contrast is All You Need For High-Fidelity Text-to-Image Diffusion Models

MotionFlow: Attention-Driven Motion Transfer in Video Diffusion Models

MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance

Conditional Information Gain Trellis