aboutsummaryrefslogtreecommitdiffstats
diff options
context:
space:
mode:
authorBryan Newbold <bnewbold@archive.org>2020-10-01 19:28:17 -0700
committerBryan Newbold <bnewbold@archive.org>2020-10-01 19:33:06 -0700
commit480278d04c3e01d8ec649e960355667207976ebf (patch)
tree4df229d957b3cb3e02a27799aaaaad5203ed327a
parentb82fdf4dd0e7df6cf7b080f5afcf33618e682a08 (diff)
downloadfatcat-scholar-480278d04c3e01d8ec649e960355667207976ebf.tar.gz
fatcat-scholar-480278d04c3e01d8ec649e960355667207976ebf.zip
update TODO file
-rw-r--r--TODO.txt11
1 files changed, 4 insertions, 7 deletions
diff --git a/TODO.txt b/TODO.txt
index 632e187..f159a99 100644
--- a/TODO.txt
+++ b/TODO.txt
@@ -1,11 +1,9 @@
-- add gzip to intermediate files pipeline
-- "counts" target to summarize (to console)
-- calculate/fetch shadow as well (?)
-
content/pipeline:
-- plan for getting release dumps sorted by work ident
+- continuous update worker from fatcat
+- add gzip to intermediate files pipeline commands
- parallelize SIM indexing
+- makefile targets for bulk ingest
cleanups:
- better typing/annotation of work pipeline
@@ -13,10 +11,9 @@ cleanups:
- use settings.toml for defaults of CLI args
ponder:
-- robots.txt
-- some space-holder for missing thumbnails
- smaller author font size (?)
- "search inside" phrasing
+- "counts" target to summarize (to console)
data quality:
- handle sim_issue items with multiple issues in single item (eg, issue="3-4")