Commit message (Collapse) | Author | Age | Files | Lines | ||
---|---|---|---|---|---|---|
... | ||||||
* | elastic_transform: typo fix | Bryan Newbold | 2020-04-02 | 1 | -1/+1 | |
| | ||||||
* | first iteration of web interface | Bryan Newbold | 2020-04-01 | 16 | -0/+767 | |
| | | | | | | | Copied and tweaked from fatcat:python/fatcat_web LICENSE file for this repo is a TODO and will need to match that of fatcat. | |||||
* | pipenv: add Flask, elasticsearch, testing | Bryan Newbold | 2020-04-01 | 2 | -2/+274 | |
| | ||||||
* | start python module directory | Bryan Newbold | 2020-04-01 | 2 | -0/+88 | |
| | ||||||
* | scripts for data-munging fulltext | Bryan Newbold | 2020-04-01 | 2 | -0/+351 | |
| | ||||||
* | elasticsearch schemas | Bryan Newbold | 2020-04-01 | 2 | -0/+270 | |
| | ||||||
* | helper for renaming files with extensions | Bryan Newbold | 2020-04-01 | 1 | -0/+8 | |
| | ||||||
* | move scripts/ to bin/ | Bryan Newbold | 2020-04-01 | 4 | -1/+200 | |
| | ||||||
* | update missing notes and commands | Bryan Newbold | 2020-04-01 | 2 | -5/+41 | |
| | ||||||
* | add user-agent to deliver_file2disk | Bryan Newbold | 2020-03-30 | 1 | -1/+3 | |
| | ||||||
* | deliver_file2disk: fewer retries, TooManyRedirects | Bryan Newbold | 2020-03-30 | 1 | -1/+3 | |
| | ||||||
* | update gitignore | Bryan Newbold | 2020-03-30 | 2 | -0/+7 | |
| | ||||||
* | update wanfang scrape | Bryan Newbold | 2020-03-30 | 2 | -1/+9 | |
| | ||||||
* | missing: patching metadata for missing fatcat records | Bryan Newbold | 2020-03-30 | 1 | -0/+23 | |
| | ||||||
* | update commands for 2020-03-27 dump; add esbulk ingest | Bryan Newbold | 2020-03-30 | 1 | -7/+16 | |
| | ||||||
* | add README.md | Bryan Newbold | 2020-03-30 | 1 | -0/+25 | |
| | ||||||
* | first iteration of CNKI and Wanfang scrapers | Bryan Newbold | 2020-03-29 | 3 | -0/+131 | |
| | ||||||
* | commands (2020-03-20 version) | Bryan Newbold | 2020-03-27 | 1 | -0/+29 | |
| | ||||||
* | pipenv: add pipfile | Bryan Newbold | 2020-03-24 | 2 | -0/+41 | |
| | ||||||
* | move and tweak scripts | Bryan Newbold | 2020-03-24 | 2 | -23/+7 | |
| | ||||||
* | notes on missing papers | Bryan Newbold | 2020-03-24 | 1 | -0/+282 | |
| | ||||||
* | commit CORD19 munging scripts | Bryan Newbold | 2020-03-23 | 3 | -0/+356 | |
| | ||||||
* | init repo with early notes | Bryan Newbold | 2020-03-23 | 3 | -0/+99 | |