| Saturday November 15 @14:00 UTC+1: Room Wichernhaus "zenith" | ||||
|---|---|---|---|---|
| Speaker | Title | Time | Abstract | Material |
| Gilles Landais | Trustworthy AI (and the Eu AI Act) | 10+5 | Quick report on the European Articial Interlligence act. | |
| Thomas Aynaud (remote) | Software Heritage, project presentation and applications to AI | 20+5 | This talk will provide an overview of what Software Heritage is. We will highlight the data we collect and how we do that. It includes the source code and all the events that can be observed during the development process. We will then explore how to utilize and enrich this information to develop services relevant to AI and the actions we take to create a more open, responsible, and transparent AI. | |
| Thomas Boch | First feedback on implementing MCP server to access CDS services | 15+5 | MCP (Model Context Protocol) is a standard aiming at connecting AI tools to external data sources. We will present the MCP server we developed at CDS, and demonstrate how it allows AI applications to access and consume data from Sesame, MocServer, HiPS2FITS, etc. | |
| François Lanusse (remote) | Make astronomical data AI-ready across surveys: Lessons-learned from the Multimodal Universe Project | 15+5 | Deep Learning has seen a recent shift in paradigm, from training specialized models on dedicated datasets, to so-called Foundation Models, trained in a self-supervised manner on vast amounts of data and then adapted to solve specific tasks with state-of-the-art performance. This new paradigm has been exceptionally successful not only for large language models (LLMs) but in other domains such as vision models. However applications of this new approach in astrophysics are still very scarce, for reasons ranging from new architectures to the (surprising) lack of availability of suitable large scale datasets. In this talk, I will discuss our work on building the Multimodal Universe Dataset, the first attempt at building a framework for homogenizing data across many astronomical surveys specifically for AI training purposes. Besides discussing the technical solution we adopted, I will highlight some of the outstanding challenges to enable cross-matching across surveys and streaming data in a way that is compatible with large AI model training. Project page: https://github.com/MultimodalUniverse/MultimodalUniverse |
|
| All | Discussions | 10 | ||
| I | Attachment | History | Action | Size | Date | Who | Comment |
|---|---|---|---|---|---|---|---|
| |
trustworthAI.pdf | r3 r2 r1 | manage | 478.3 K | 2025-11-14 - 20:34 | GillesLandais |