aboutsummaryrefslogtreecommitdiffstats
path: root/README.md
diff options
context:
space:
mode:
authorMartin Czygan <martin.czygan@gmail.com>2021-04-01 17:46:23 +0200
committerMartin Czygan <martin.czygan@gmail.com>2021-04-01 17:46:23 +0200
commitd6b3a1d229339876b07c58cf8492584180536262 (patch)
tree41e43f5e21f92ff24bcfd6f3e276624587f6badd /README.md
parent513acd4fc15c6164cf292fa2065c38611a77317d (diff)
downloadrefcat-d6b3a1d229339876b07c58cf8492584180536262.tar.gz
refcat-d6b3a1d229339876b07c58cf8492584180536262.zip
update README
Diffstat (limited to 'README.md')
-rw-r--r--README.md19
1 files changed, 14 insertions, 5 deletions
diff --git a/README.md b/README.md
index eb8461d..532eda2 100644
--- a/README.md
+++ b/README.md
@@ -17,29 +17,38 @@ We use informal, internal versioning, currently v2, next will be v3.
* [x] Link PID or DOI to archived versions
-As of v2, we have linkage between fatcat release entities by doi, pmid, pmcid, arxiv.
+> As of v2, we have linkage between fatcat release entities by doi, pmid, pmcid, arxiv.
* [ ] URLs in corpus linked to best possible timestamp (GWB)
* [ ] Harvest all URLs in citation corpus (maybe do a sample first)
-A seed-list (from refs; not from the full-text) is done; need to prepare a crawl and lookups in GWB.
+> A seed-list (from refs; not from the full-text) is done; need to prepare a crawl and lookups in GWB.
* [ ] Links between records w/o DOI (fuzzy matching)
-As of v2, we do have a fuzzy matching procedure (yielding about 5-10% of the total results).
+> As of v2, we do have a fuzzy matching procedure (yielding about 5-10% of the total results).
* [ ] Publication of augmented citation graph, explore data mining, etc.
* [ ] Interlinkage with other source, monographs, commercial publications, etc.
-As of v3, we have a minimal linkage with wikipedia.
+> As of v3, we have a minimal linkage with wikipedia.
* [ ] Wikipedia (en) references metadata or archived record
-This is ongoing and should be part of v3.
+> This is ongoing and should be part of v3.
* [ ] Metadata records for often cited non-scholarly web publications
* [ ] Collaborations: I4OC, wikicite
+We attended an online workshop in 09/2020, organized in part by OCI members;
+recording: [fatcat five minute
+intro](https://archive.org/details/fatcat_workshop_open_citations_open_scholarly_metadata_2020)
+
+# TODO
+
+* [ ] create a first index, ES7 [schema PR](https://git.archive.org/webgroup/fatcat/-/merge_requests/99)
+* [ ] build API, [spec notes](https://git.archive.org/webgroup/fatcat/-/blob/10eb30251f89806cb7a0f147f427c5ea7e5f9941/proposals/2021-01-29_citation_api.md)
+
# IA Use Cases
* [ ] discovery tool, e.g. "cited by ..." link