aboutsummaryrefslogtreecommitdiffstats
Commit message (Collapse)AuthorAgeFilesLines
* refactor enrich into fatcat_covid19Bryan Newbold2020-04-034-109/+83
|
* refactor elastic transform into CLI toolBryan Newbold2020-04-032-26/+21
|
* refactor derivatives into CLI toolBryan Newbold2020-04-032-66/+61
|
* document scripts and tools a bitBryan Newbold2020-04-038-0/+32
|
* add LICENSE and CONTRIBUTORS filesBryan Newbold2020-04-033-0/+694
|
* search: remove debugging statementBryan Newbold2020-04-031-1/+1
|
* tweak fulltext ES schemaBryan Newbold2020-04-032-41/+62
|
* UI tweaks, i18n prepBryan Newbold2020-04-0314-172/+155
|
* update gitignoreBryan Newbold2020-04-031-1/+4
|
* uwsgi config fileBryan Newbold2020-04-021-0/+14
|
* add an example .env fileBryan Newbold2020-04-021-0/+4
|
* WIP: top-level helper scriptBryan Newbold2020-04-021-0/+55
| | | | | Goal is to refactor most other covid19-specific commands into this tool, with code living in the fatcat_covid19 module.
* include container_original_name ES fieldBryan Newbold2020-04-022-0/+2
|
* basic fulltext search highlightingBryan Newbold2020-04-024-16/+58
|
* transform: include missing external identsBryan Newbold2020-04-021-0/+4
|
* webface: refactoring of styleBryan Newbold2020-04-025-77/+140
|
* flask: can't flash() without cookiesBryan Newbold2020-04-021-1/+1
|
* elastic_transform: typo fixBryan Newbold2020-04-021-1/+1
|
* first iteration of web interfaceBryan Newbold2020-04-0116-0/+767
| | | | | | | Copied and tweaked from fatcat:python/fatcat_web LICENSE file for this repo is a TODO and will need to match that of fatcat.
* pipenv: add Flask, elasticsearch, testingBryan Newbold2020-04-012-2/+274
|
* start python module directoryBryan Newbold2020-04-012-0/+88
|
* scripts for data-munging fulltextBryan Newbold2020-04-012-0/+351
|
* elasticsearch schemasBryan Newbold2020-04-012-0/+270
|
* helper for renaming files with extensionsBryan Newbold2020-04-011-0/+8
|
* move scripts/ to bin/Bryan Newbold2020-04-014-1/+200
|
* update missing notes and commandsBryan Newbold2020-04-012-5/+41
|
* add user-agent to deliver_file2diskBryan Newbold2020-03-301-1/+3
|
* deliver_file2disk: fewer retries, TooManyRedirectsBryan Newbold2020-03-301-1/+3
|
* update gitignoreBryan Newbold2020-03-302-0/+7
|
* update wanfang scrapeBryan Newbold2020-03-302-1/+9
|
* missing: patching metadata for missing fatcat recordsBryan Newbold2020-03-301-0/+23
|
* update commands for 2020-03-27 dump; add esbulk ingestBryan Newbold2020-03-301-7/+16
|
* add README.mdBryan Newbold2020-03-301-0/+25
|
* first iteration of CNKI and Wanfang scrapersBryan Newbold2020-03-293-0/+131
|
* commands (2020-03-20 version)Bryan Newbold2020-03-271-0/+29
|
* pipenv: add pipfileBryan Newbold2020-03-242-0/+41
|
* move and tweak scriptsBryan Newbold2020-03-242-23/+7
|
* notes on missing papersBryan Newbold2020-03-241-0/+282
|
* commit CORD19 munging scriptsBryan Newbold2020-03-233-0/+356
|
* init repo with early notesBryan Newbold2020-03-233-0/+99