aboutsummaryrefslogtreecommitdiffstats
Commit message (Collapse)AuthorAgeFilesLines
...
* elastic_transform: typo fixBryan Newbold2020-04-021-1/+1
|
* first iteration of web interfaceBryan Newbold2020-04-0116-0/+767
| | | | | | | Copied and tweaked from fatcat:python/fatcat_web LICENSE file for this repo is a TODO and will need to match that of fatcat.
* pipenv: add Flask, elasticsearch, testingBryan Newbold2020-04-012-2/+274
|
* start python module directoryBryan Newbold2020-04-012-0/+88
|
* scripts for data-munging fulltextBryan Newbold2020-04-012-0/+351
|
* elasticsearch schemasBryan Newbold2020-04-012-0/+270
|
* helper for renaming files with extensionsBryan Newbold2020-04-011-0/+8
|
* move scripts/ to bin/Bryan Newbold2020-04-014-1/+200
|
* update missing notes and commandsBryan Newbold2020-04-012-5/+41
|
* add user-agent to deliver_file2diskBryan Newbold2020-03-301-1/+3
|
* deliver_file2disk: fewer retries, TooManyRedirectsBryan Newbold2020-03-301-1/+3
|
* update gitignoreBryan Newbold2020-03-302-0/+7
|
* update wanfang scrapeBryan Newbold2020-03-302-1/+9
|
* missing: patching metadata for missing fatcat recordsBryan Newbold2020-03-301-0/+23
|
* update commands for 2020-03-27 dump; add esbulk ingestBryan Newbold2020-03-301-7/+16
|
* add README.mdBryan Newbold2020-03-301-0/+25
|
* first iteration of CNKI and Wanfang scrapersBryan Newbold2020-03-293-0/+131
|
* commands (2020-03-20 version)Bryan Newbold2020-03-271-0/+29
|
* pipenv: add pipfileBryan Newbold2020-03-242-0/+41
|
* move and tweak scriptsBryan Newbold2020-03-242-23/+7
|
* notes on missing papersBryan Newbold2020-03-241-0/+282
|
* commit CORD19 munging scriptsBryan Newbold2020-03-233-0/+356
|
* init repo with early notesBryan Newbold2020-03-233-0/+99