diff options
-rw-r--r-- | README.md | 19 |
1 files changed, 18 insertions, 1 deletions
@@ -9,17 +9,34 @@ all relevant code close: Context: [fatcat](https://fatcat.wiki), "Mellon Grant" (20/21). +We use informal, internal versioning, currently v2, next will be v3. + # Grant related tasks 3/4 phases of the grant contain citation graph related tasks. -* [ ] Link PID or DOI to archived versions +* [x] Link PID or DOI to archived versions + +As of v2, we have linkage between fatcat release entities by doi, pmid, pmcid, arxiv. + * [ ] URLs in corpus linked to best possible timestamp (GWB) * [ ] Harvest all URLs in citation corpus (maybe do a sample first) + +A seed-list (from refs; not from the full-text) is done; need to prepare a crawl and lookups in GWB. + * [ ] Links between records w/o DOI (fuzzy matching) + +As of v2, we do have a fuzzy matching procedure (yielding about 5-10% of the total results). + * [ ] Publication of augmented citation graph, explore data mining, etc. * [ ] Interlinkage with other source, monographs, commercial publications, etc. + +As of v3, we have a minimal linkage with wikipedia. + * [ ] Wikipedia (en) references metadata or archived record + +This is ongoing and should be part of v3. + * [ ] Metadata records for often cited non-scholarly web publications * [ ] Collaborations: I4OC, wikicite |