Probably start with: crossref (including citations) arxiv medline then merge in: dblp CORE oaDOI archive.org paper/url manifest semantic scholar and later: opencitations openlibrary