diff options
author | Bryan Newbold <bnewbold@archive.org> | 2021-03-23 21:42:32 -0700 |
---|---|---|
committer | Bryan Newbold <bnewbold@archive.org> | 2021-03-23 21:42:32 -0700 |
commit | 5defd444135bc4adb0748b0d2b8c9b88708bdc1a (patch) | |
tree | 599498f0a9ae5a3177d9702c3a7e8b70e39b2b4a /proposals/2020-05-11_overview.md | |
parent | e70e7cff4b5c910405694fb297330507b49937b1 (diff) | |
download | fatcat-scholar-5defd444135bc4adb0748b0d2b8c9b88708bdc1a.tar.gz fatcat-scholar-5defd444135bc4adb0748b0d2b8c9b88708bdc1a.zip |
proposals: add 2021 UI updates, and rename all to have a date in filename
Diffstat (limited to 'proposals/2020-05-11_overview.md')
-rw-r--r-- | proposals/2020-05-11_overview.md | 38 |
1 files changed, 38 insertions, 0 deletions
diff --git a/proposals/2020-05-11_overview.md b/proposals/2020-05-11_overview.md new file mode 100644 index 0000000..fa8148c --- /dev/null +++ b/proposals/2020-05-11_overview.md @@ -0,0 +1,38 @@ + + +Can be multiple releases for each work: + +- required: most canonical published version ("version of record", what would be cited) + => or, most updated? +- optional: mostly openly accessible version +- optional: updated version + => errata, corrected version, or retraction +- optional: fulltext indexed version + => might be not the most updated, or no accessible + + +## Initial Plan + +Index all fatcat works in catalog. + +Always link to a born-digital copy if one is accessible. + +Always link to a SIM microfilm copy if one is available. + +Use best available fulltext for search. If structured, like TEI-XML, index the +body text separate from abstracts and references. + + +## Other Ideas + +Do fulltext indexing at the granularity of pages, or some other segments of +text within articles (paragraphs, chapters, sections). + +Fatcat already has all of Crossref, Pubmed, Arxiv, and several other +authoritative metadata sources. But today we are missing a good chunk of +content, particularly from institutional repositories and CS conferences (which +don't use identifiers). Also don't have good affiliation or citation count +coverage, and mixed/poor abstract coverage. + +Could use Microsoft Academic Graph (MAG) metadata corpus (or similar) to +bootstrap with better metadata coverage. |