Home
About
News
People
- The Kavli ENSI brings together 23 world-class scientists from a diverse set of backgrounds with a combined interest and focus on learning the fundamentals of how energy flows and can be controlled on the nanoscale, where the rules of quantum mechanics and statistical mechanics teach us that entirely new ways of thinking are possible. The institute has developed rapidly to become a vibrant hub of engagement on these issues across the Berkeley campus and the Berkeley Lab.
  
  Overview
  
  Leadership
  
  Affiliated Faculty
  
  Emeriti Faculty
  
  Fellows
  
  Visiting Scholars
  
  Group Representatives
  
  Staff
Programs
- Strong Partnerships for the Future: Kavli ENSI has implemented numerous programs to increase interdisciplinary collaborations.
  
  Overview
  
  Heising-Simons Junior Fellowship
  
  Kavli ENSI Graduate Student Fellowship
  
  Student Thesis Prize Awards
  
  Kavli ENSI / KENTECH Exchange Program
  
  Winton, Cambridge-Kavli ENSI, Berkeley Exchange Program
  
  Computer Resources
Events
- ‎The Kavli Institute scholars engage the energy nanosciences community in regular forums that contribute to the intellectual agenda of the institute. To this end, we have created several venues and events that will increase interdisciplinary collaborations.
  
  Overview
  
  Research Seminars
  
  Distinguished Lectures
  
  Workshops
  
  10 Year Anniversary
  
  Retreats
  
  Inaugural Events
  
  Symposiums
Publications

Secondary navigation

Structured information extraction from scientific text with large language models

Abstract:

Extracting structured knowledge from scientific text remains a challenging task for machine learning models. Here, we present a simple approach to joint named entity recognition and relation extraction and demonstrate how pretrained large language models (GPT-3, Llama-2) can be fine-tuned to extract useful records of complex scientific knowledge. We test three representative tasks in materials chemistry: linking dopants and host materials, cataloging metal-organic frameworks, and general composition/phase/morphology/application information extraction. Records are extracted from single sentences or entire paragraphs, and the output can be returned as simple English sentences or a more structured format such as a list of JSON objects. This approach represents a simple, accessible, and highly flexible route to obtaining large databases of structured specialized scientific knowledge extracted from research papers.

Author:

Dagdelen, J.

Dunn, A.

Lee, S.

Walker, N.

Rosen, A. S.

Ceder, G.

Persson, K. A.

Jain, A.

Publication date:

February 15, 2024

Publication type:

Journal Article

Document

https://www.nature.com/articles/s41467-024-45563-x

Topics

2024 New's Items topic page