aboutsummaryrefslogtreecommitdiffstats
path: root/notes
Commit message (Collapse)AuthorAgeFilesLines
* more unpaywall ingest notesBryan Newbold2020-03-051-0/+416
|
* update (and move) ingest notesBryan Newbold2020-03-036-0/+480
|
* ingest backfill notesBryan Newbold2020-02-243-0/+150
|
* jan 2020 bulk ingest notesBryan Newbold2020-02-121-0/+26
|
* add notes on recent ingest and backfill tasksBryan Newbold2020-02-053-0/+221
|
* hadoop job log rename and updateBryan Newbold2019-12-271-0/+25
|
* update job log with pig runsBryan Newbold2019-12-261-0/+10
|
* updated re-GROBID job log entryBryan Newbold2019-11-151-0/+31
|
* ingest/backfill notesBryan Newbold2019-11-133-0/+47
|
* notes about running 'regrobid' batches manually (not kafka)Bryan Newbold2019-11-131-0/+41
|
* commit old notes about munging GROBID outputBryan Newbold2019-11-131-0/+70
|
* old groupworks job logBryan Newbold2019-09-201-0/+8
|
* petabox journal files ingest updatesBryan Newbold2019-06-201-0/+25
|
* clearer CDX munge notesBryan Newbold2019-05-091-1/+1
|
* give sort way more RAM by defaultBryan Newbold2019-02-013-6/+6
|
* match_filter_enrich notesBryan Newbold2019-01-031-0/+12
|
* notes on file-level metadata dumpBryan Newbold2018-12-191-0/+31
|
* update notesBryan Newbold2018-12-101-1/+14
|
* match_filter_enrich: fix typoBryan Newbold2018-09-221-1/+1
|
* match and enrich notes+scriptBryan Newbold2018-09-141-0/+19
|
* crude job stats/metrics in a text fileBryan Newbold2018-08-271-0/+95
|
* update TODOBryan Newbold2018-08-241-0/+10
|
* commit notes from my laptopBryan Newbold2018-08-246-0/+256