diff options
author | Bryan Newbold <bnewbold@archive.org> | 2020-10-01 19:28:17 -0700 |
---|---|---|
committer | Bryan Newbold <bnewbold@archive.org> | 2020-10-01 19:33:06 -0700 |
commit | 480278d04c3e01d8ec649e960355667207976ebf (patch) | |
tree | 4df229d957b3cb3e02a27799aaaaad5203ed327a /TODO.txt | |
parent | b82fdf4dd0e7df6cf7b080f5afcf33618e682a08 (diff) | |
download | fatcat-scholar-480278d04c3e01d8ec649e960355667207976ebf.tar.gz fatcat-scholar-480278d04c3e01d8ec649e960355667207976ebf.zip |
update TODO file
Diffstat (limited to 'TODO.txt')
-rw-r--r-- | TODO.txt | 11 |
1 files changed, 4 insertions, 7 deletions
@@ -1,11 +1,9 @@ -- add gzip to intermediate files pipeline -- "counts" target to summarize (to console) -- calculate/fetch shadow as well (?) - content/pipeline: -- plan for getting release dumps sorted by work ident +- continuous update worker from fatcat +- add gzip to intermediate files pipeline commands - parallelize SIM indexing +- makefile targets for bulk ingest cleanups: - better typing/annotation of work pipeline @@ -13,10 +11,9 @@ cleanups: - use settings.toml for defaults of CLI args ponder: -- robots.txt -- some space-holder for missing thumbnails - smaller author font size (?) - "search inside" phrasing +- "counts" target to summarize (to console) data quality: - handle sim_issue items with multiple issues in single item (eg, issue="3-4") |