Xenia Grote - WestAI

DataComp: In search of the next generation of multimodal datasets

Multimodal datasets are a critical component in recent breakthroughs such as CLIP, Stable Diffusion and GPT-4, yet their design does not receive the same research attention as model architectures or training algorithms. To address this shortcoming in the machine...

LAION-5B: An open large-scale dataset for training next generation image-text models

Groundbreaking language-vision architectures like CLIP and DALL-E proved the utility of training on large amounts of noisy image-text data, without relying on expensive accurate labels used in standard vision unimodal supervised learning. The resulting models showed...

Handelsblatt: Warum von der Leyens Supercomputer Deutschlands KI-Start-ups nur bedingt helfen

In dem Artikel wird unter anderem mit Herrn Prof. Jürgen Gall diskutiert, wie die KI-Initiative der EU, die von Ursula von der Leyen angekündigt wurde, deutschen KI-Start-ups helfen kann. Es wird darauf hingewiesen, dass die Initiative, die den Aufbau von...

Curvature-based Pooling within Graph Neural Networks

Over-squashing and over-smoothing are two critical issues, that limit the capabilities of graph neural networks (GNNs). While over-smoothing eliminates the differences between nodes making them indistinguishable, over-squashing refers to the inability of GNNs to...

Rank Collapse Causes Over-Smoothing and Over-Correlation in Graph Neural Networks

Our study reveals new theoretical insights into over-smoothing and feature over-correlation in deep graph neural networks. We show the prevalence of invariant subspaces, demonstrating a fixed relative behavior that is unaffected by feature transformations. Our work...