Responsive image





SITUATE - Synthetic Object Counting Dataset for VLM training

Peinl, René; Tischler, Vincent; Schröder, Patrick; Groth, Christian (2026)

21st International Conference on Computer Vision Theory and Applications (VISAPP26), Marbella, Spain.


Open Access Peer Reviewed
 

We present SITUATE, a novel dataset designed for training and evaluating Vision Language Models on counting tasks with spatial constraints. The dataset bridges the gap between simple 2D datasets like VLMCountBench and often ambiguous real-life datasets like TallyQA, which lack control over occlusions and spatial composition. Experiments show that our dataset helps to improve generalization for out-of-distribution images, since a finetune of Qwen VL 2.5 7B on SITUATE improves accuracy on the Pixmo count test data, but not vice versa. We cross validate this by comparing the model performance across established other counting benchmarks and against an equally sized fine-tuning set derived from Pixmo count.

mehr

Evaluation of the effect of camera viewing angles on the quality of human pose estimation in river surfing

Zöllner, Michael; Krause, Moritz; Groth, Christian; Kniesburges, Stefan...

iWOAR 2025 - 10th international Workshop on Sensor-Based Activity Recognition and Artificial Intelligence.


Peer Reviewed

Towards a size-aware implicit 2-D cloth rendering

Scharnagl, Bastian; Groth, Christian (2025)

2025 IEEE International Conference on Artificial Intelligence and eXtended and Virtual Reality (AIxVR).


Peer Reviewed
 

There is a huge demand for trying on clothing at home. Recent methods to capture your figure mostly work in a 2d image plane and despite recent improvements of available technology the simulation of clothing is still not satisfactory. Especially the rendering of different clothing sizes is still a major challenge, which is only addressed by COTTON [1]. We propose an improvement to this approach by adding more control over the image generation process. For this we employ a special type of conditional diffusion model, namely ControlNet, and take keypoints of the fashion as conditional input. (This work is a part of the project (M4-SKI) has been supported and funded by the European Regional Development Fund (ERDF)).

mehr

Enhancing Fitness Visualization: Application and Efficacy of Realistic Inpainting Techniques Using Diffusion Models

Kemnitzer, Jonas; Groth, Christian (2024)

Proceedings of the 2nd International Conference on AI-generated Content 2024.


Peer Reviewed
 

In this paper we present a stable-diffusion based zero-shot approach to realistically transform the image of a

human body into a more fit version of that depicted person. Therefore we combine a modified stable diffusion

model with inpainting techniques and incorporated constraints. We introduce a prototype which allows users to

upload a photo and visualize a more fit version of themselves. We evaluated our approach in various experiments

and focused on the applicability and effectiveness of these techniques, with attention to gender-specific results.

This work contributes to the fields of computer vision and generative AI by demonstrating practical applications

and identifying areas for improvement in realistic body transformation visualizations. (This work is a part of the project (M4-SKI) has been supported and funded by the European Regional Development Fund (ERDF)).


KPNet: Towards a parameterized implicit 2d cloth rendering

Scharnagl, Bastian; Groth, Christian (2024)


Peer Reviewed
 

The simulation of clothing for a virtual try on is still a challenging task, especially if the customer wants to use state of the art technology. To address this, we employ a 2D plane to process customer images. Specifically, we utilize a neural network, notably an autoencoder, to render so called fashion landmarks. As input we use human keypoints that represent the model poses and our fashion landmarks of the clothing from stock photos to generate fashion landmarks in the desired pose. These can be utilized by additional algorithms to adapt the clothing length or width.

This project has been funded by the European Regional Development Fund (EFRE).

mehr

Evaluation of different deep learning approaches for EEG classification

Scharnagl, Bastian; Groth, Christian (2022)

5th International Conference on Artificial Intelligence for Industries (AI4I). 2022.


Peer Reviewed
 

EEG classification is a promising approach to facilitate the life of handicapped people and to generate future human-computer-interfaces. In this paper we want to compare the effectiveness of current state of the art deep learning techniques for EEG classification. Therefore, we applied different approaches on various datasets and did a crosscomparison of the results in order to get more knowledge on the generalization capabilities. Additionally, we created a new EEG dataset and made it available for further research.


Prof. Dr. Christian Groth


Hochschule für Angewandte Wissenschaften Hof

Forschungsgruppe Intelligente und Lernende Systeme (ils)
Alfons-Goppel-Platz 1
95028 Hof

T +49 9281 409-4811
christian.groth[at]hof-university.de