TWiki
>
Main Web
>
InterOpMay2024KD
(revision 3) (raw view)
Edit
Attach
---++ Knowledge Discovery Time: Wednesday May 22, 2024 11:00-12:30 Australian Eastern Standard Time [[https://www.timeanddate.com/worldclock/fixedtime.html?msg=KD&iso=20230509T1100][<br /><br />]] | *Speaker* | *Title* | *Time* | *Abstract* | *Material* | | _Yihan Tao_ | Greetings and Introduction | 5' | | [[%PUBURL%/IVOA/InterOpMay2023KD/slides_session_IVOAInterOp_Bologna_2023.pdf][p]]df | | _Alberto Accomazzi_ | BiblioPile: Building a Dataset to Support AI-enabled Bibliography Curation efforts | 15'+5' | A well-established way to assess the scientific impact of an observational facility in astronomy is the quantitative analysis of the studies published in the literature which have made use of the data taken by the facility. A requirement of such analysis is the creation of bibliographies which annotate and link data products with the literature, thus providing a way to use bibliometrics as an impact measure for the underlying data. An automated assistant able to emulate some of the associated activities would provide a valuable contribution to the human effort involved. LLMs have shown flexibility in interpreting and classifying scientific articles which are the basis for this curation activity. They have also been successfully used for information extraction tasks, which would help identify the specific datasets mentioned in the papers. In this talk I will describe our effort to create the BiblioPile, a contributed dataset consisting of open access fulltext papers and annotated bibliography from institutions that maintain them in order to help train AI/ML bibliographic annotation pipelines. | [[%PUBURL%/IVOA/InterOpMay2023KD/20230509-Kruk-IVOABologna-Exploring_astronomy_data_archives_at_scale_using_deep_learning_and_crowdsourcing.pdf][ ]]pdf | | _Yan Shao_ | Generative Named Entity Normalization for Astronomical Facilities | 15'+5' | Named entity normalization for astronomical facilities is crucial in the related academic research. Unlike the majority of the previous work, we model named entity normalization as a sequence generation problem via utilizing large language models, without assuming a comprehensive set of predefined normalized forms for any entities. Four entity normalization scenarios that are likely to occur in real-world application are discussed specifically, depending on whether the explicit normalization rules as well as the corresponding annotated instances are available. Moreover, we propose respective generative normalization methods and evaluate on datasets compiled from the standard telescope name lists maintained by the American Astronomical Society (AAS) and the Astrophysics Data System (ADS). The empirical findings demonstrate that the analytical, inductive, and generative capabilities of LLMs empower generative entity normalization to achieve commendable performances, even under very stringent conditions. The generative normalization effectively remedies the shortcomings of the retrieval-based methods. | pdf | | _Kai Polsterer_ | | 15'+5' | | pdf | | _Panel + audience<br />(Yihan Tao, Kai Polstererk, Rafael Martinez Galarza, Alberto Accomazzi)_ | Panel-led discussion | 30' | Seeding topics for discussion <p>1. How can state-of-the-art AI technologies, such as LLMs, fundation models and agents enhance the VO?</p> <p>2. What are the potential applications of these AI technologies within the VO framework?</p> <p>3. What are the best practices and strategies for integrating AI agents and models with VO tools and science platforms that can help user efficiently access to and analyse astronomical data? What are the challenges?</p> | | Moderator: [[Raffaele D'Abrusco]], Notetaker: [[IVOA.TBD][TBD]], [[https://yopad.eu/p/IVOA_Nov3_KD][Etherpad link]]
Edit
|
Attach
|
Watch
|
P
rint version
|
H
istory
:
r10
|
r5
<
r4
<
r3
<
r2
|
B
acklinks
|
V
iew topic
|
Raw edit
|
More topic actions...
Topic revision: r3 - 2024-05-16
-
RaffaeleDAbrusco
Main
Log in
or
Register
Main Web
Create New Topic
Index
Search
Changes
Notifications
RSS Feed
Statistics
Preferences
Webs
IVOA
PDL1RFC
PhotDM1
Spectral2
SpectralDM2
Know
Main
Sandbox
TWiki
Copyright © 2008-2025 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki?
Send feedback