aboutsummaryrefslogtreecommitdiffstats
path: root/rfc.md
diff options
context:
space:
mode:
authorBryan Newbold <bnewbold@robocracy.org>2018-03-19 21:25:33 -0700
committerBryan Newbold <bnewbold@robocracy.org>2018-03-19 21:25:33 -0700
commit3e93149d8a0fb979f6f678ccfd13809d86c803a6 (patch)
tree1cc8416d5a770d317e07e61547d0be9017b93682 /rfc.md
parent9dcebe3a0dd8cb2a4a31de29b757e9950a2399b4 (diff)
downloadfatcat-3e93149d8a0fb979f6f678ccfd13809d86c803a6.tar.gz
fatcat-3e93149d8a0fb979f6f678ccfd13809d86c803a6.zip
(old) notes
Diffstat (limited to 'rfc.md')
-rw-r--r--rfc.md18
1 files changed, 15 insertions, 3 deletions
diff --git a/rfc.md b/rfc.md
index fd9397ad..1b63a31a 100644
--- a/rfc.md
+++ b/rfc.md
@@ -254,12 +254,15 @@ are:
name
open-access policy
peer-review policy
+ <has> aliases, acronyms
+ <about> subject/category
<has> identifier
<published in> container
<published-by> publisher
publisher
name
+ <has> aliases, acronyms
<has> identifier
## Controlled Vocabularies
@@ -271,8 +274,10 @@ have more controlled editing workflow... perhaps versioned in the codebase:
- identifier namespaces (DOI, ISBN, ISSN, ORCID, etc)
- subject categorization
- license and open access status
-- work types
+- work "types" (article vs. book chapter vs. proceeding, etc)
- contributor types (author, translator, illustrator, etc)
+- human languages
+- file mimetypes
## Unresolved Questions
@@ -309,12 +314,19 @@ I see a tension between focus and scope creep. If a central database like
fatcat doesn't support enough fields and metadata, then it will not be possible
to completely import other corpuses, and this becomes "yet another" partial
bibliographic database. On the other hand, accepting arbitrary data leads to
-other problems:
+other problems: sparseness increases (we have more "partial" data), potential
+for redundancy is high, humans will start editing content that might be
+bulk-replaced, etc.
+
+There might be a need to support "stub" references between entities. Eg, when
+adding citations from PDF extraction, the cited works are likely to be
+ambiguous. Could create "stub" works to be merged/resolved later, or could
+leave the citation hanging. Same with authors, containers (journals), etc.
## References and Previous Work
The closest overall analog of fatcat is [MusicBrainz][mb], a collaboratively
-edited music database. [Open Library][] is a very similar existing service,
+edited music database. [Open Library][ol] is a very similar existing service,
which exclusively contains book metadata.
[Wikidata][wd] seems to be the most successful and actively edited/developed