From 5a4f106a89a22cb737fafcacca717d60363baf2a Mon Sep 17 00:00:00 2001 From: Martin Czygan Date: Thu, 1 Apr 2021 01:01:06 +0200 Subject: update README --- skate/README.md | 13 ++++++++----- 1 file changed, 8 insertions(+), 5 deletions(-) diff --git a/skate/README.md b/skate/README.md index cc7e238..6fa4ae2 100644 --- a/skate/README.md +++ b/skate/README.md @@ -57,16 +57,19 @@ After this step: ### skate-from-unstructured - +Takes a refs file and plucks out identifiers from unstructured field. ### skate-ref-to-release + +Converts a ref document to a release. Part of first run, merging refs and releases. + ### skate-to-doi -### skate-verify +Sanitize DOI in tabular file. + +### skate-verify -Goal: make key extraction and comparisons fast for billions of records on a -single machine to support deduplication work for [fatcat](https://fatcat.wiki) -metadata. +Run various matching and verification algorithms. ## Problem -- cgit v1.2.3