title: "Database of publication abstracts on exoplanets from NASA archive"
date: July 2019
author: "Gerd Graßhoff", "Sabrina Bier"
Collection of abstracts in publications on exoplanets, as collected by NASA. The dataframes are stored JSON prepared for importing as Pandas dataframes.
numpy.int64
consecutive ID of papers.str
identifier of the paper from NASA ADS. It is taken from NASA ADS without chances.str
Digital Object Indentifier of the actual published paper. It is taken from NASA ADS without chances. If the paper has no DOI, this is marked with 'None'.list of str
authors of the paper. If the paper has no authors, this is marked with ['None']. The number of the authors is adjusted to the reference of affiliations.list of lists of str
affiliations of the authors contributing to the paper. Unknown affiliations are marked with ['None'].str
acknowledgements given by the authors of the paper. If the paper has no acknowledgements, this is marked with 'None'. The word "acknowledgement" in all possible variations is strip off.list of str
grants, supporting the work on the paper. If the paper gives no grants, this is marked with an empty list [].str
month and year of publication of the paper, the day of publication is not stated and therefore always 00. The column 'published' has the following format: YYYY-MM-DD. It is taken from NASA ADS without any chances.numpy.int64
year in which the paper was published. It is taken from NASA ADS without chances.str
title of the paper. If the paper has no title or NASA ADS does not provide any title, this is marked with 'None'. It is taken from NASA ADS without chances.str
abstract of the paper. If the paper has no abstract, this is marked with 'None'. It is taken from NASA ADS without chances.list of str
keywords choosen by the authors to describe the paper. If the paper contains no keywords, this is marked with an empty list []. It is taken from NASA ADS without chances.numpy.int64
number of citations of the paper known to NASA ADS. It is taken from NASA ADS without chances.This database is primary for the oder files of prepocessed abstracts (dfExoplanetsNASAabsClear_v1.json) and extracted sentences (dfExoplanetsNASAabsSentences_v1.json).
Covered publication time: 1943 to Dec 2019
link to source (as July 2019): NASA ADS
information: NASA ADS info
documentation: NASA ADS help