aboutsummaryrefslogtreecommitdiffstats
Commit message (Expand)AuthorAgeFilesLines
* elastic_transform: typo fixBryan Newbold2020-04-021-1/+1
* first iteration of web interfaceBryan Newbold2020-04-0116-0/+767
* pipenv: add Flask, elasticsearch, testingBryan Newbold2020-04-012-2/+274
* start python module directoryBryan Newbold2020-04-012-0/+88
* scripts for data-munging fulltextBryan Newbold2020-04-012-0/+351
* elasticsearch schemasBryan Newbold2020-04-012-0/+270
* helper for renaming files with extensionsBryan Newbold2020-04-011-0/+8
* move scripts/ to bin/Bryan Newbold2020-04-014-1/+200
* update missing notes and commandsBryan Newbold2020-04-012-5/+41
* add user-agent to deliver_file2diskBryan Newbold2020-03-301-1/+3
* deliver_file2disk: fewer retries, TooManyRedirectsBryan Newbold2020-03-301-1/+3
* update gitignoreBryan Newbold2020-03-302-0/+7
* update wanfang scrapeBryan Newbold2020-03-302-1/+9
* missing: patching metadata for missing fatcat recordsBryan Newbold2020-03-301-0/+23
* update commands for 2020-03-27 dump; add esbulk ingestBryan Newbold2020-03-301-7/+16
* add README.mdBryan Newbold2020-03-301-0/+25
* first iteration of CNKI and Wanfang scrapersBryan Newbold2020-03-293-0/+131
* commands (2020-03-20 version)Bryan Newbold2020-03-271-0/+29
* pipenv: add pipfileBryan Newbold2020-03-242-0/+41
* move and tweak scriptsBryan Newbold2020-03-242-23/+7
* notes on missing papersBryan Newbold2020-03-241-0/+282
* commit CORD19 munging scriptsBryan Newbold2020-03-233-0/+356
* init repo with early notesBryan Newbold2020-03-233-0/+99