diff options
Diffstat (limited to 'extra/wikipedia/README.md')
-rw-r--r-- | extra/wikipedia/README.md | 3 |
1 files changed, 2 insertions, 1 deletions
diff --git a/extra/wikipedia/README.md b/extra/wikipedia/README.md index 8cfdfc0..59480a7 100644 --- a/extra/wikipedia/README.md +++ b/extra/wikipedia/README.md @@ -43,7 +43,8 @@ Within a virtualenv, use `parallel` to process like: This will output JSON lines, one line per article, with the article title, revision, site name, and any extracted references in a sub-array (of JSON -objects). +objects). As of December 2021, it takes about 17 hours on a large machine, with +the above command. ## Prior Work |