Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Revert "pull GROBID refs along with crossref records into bundles" | Bryan Newbold | 2021-11-10 | 1 | -3/+1 |
| | | | | | | This reverts commit c164970449a392b5165d903d213c2bb51f2a187f. Didn't mean to merge this to master just yet. | ||||
* | pull GROBID refs along with crossref records into bundles | Bryan Newbold | 2021-11-10 | 1 | -1/+3 |
| | |||||
* | make fmt (black 21.9b0) | Bryan Newbold | 2021-10-27 | 1 | -3/+14 |
| | |||||
* | re-style imports (isort) on all core python files | Bryan Newbold | 2021-10-27 | 1 | -1/+2 |
| | |||||
* | lint fixes, and run fmt | Bryan Newbold | 2021-06-02 | 1 | -3/+1 |
| | |||||
* | add 'crossref' hydration to work pipeline | Bryan Newbold | 2021-06-02 | 1 | -0/+11 |
| | | | | | | | | The immediate motivation is to include recent crossref refs in citation graph transforms. May also be valuable for researchers to have authoritative/publisher metadata in the bundle dumps. | ||||
* | Modernize Python syntax with pyupgrade --py38-plus **/*.py | Christian Clauss | 2021-02-23 | 1 | -1/+1 |
| | |||||
* | add basic html fulltext support to fetch pipeline | Bryan Newbold | 2020-11-18 | 1 | -0/+11 |
| | |||||
* | make fmt | Bryan Newbold | 2020-06-29 | 1 | -1/+3 |
| | |||||
* | fetch pdftotext and pdf_meta from blobs, postgrest | Bryan Newbold | 2020-06-29 | 1 | -0/+9 |
| | | | | | This replaces the temporary COVID-19 content hack with production content (text, thumbnail URLs) stored in postgrest and seaweedfs. | ||||
* | fmt | Bryan Newbold | 2020-06-04 | 1 | -1/+8 |
| | |||||
* | more type annotations and fixes | Bryan Newbold | 2020-06-04 | 1 | -2/+2 |
| | |||||
* | flake8 fixes (partial) | Bryan Newbold | 2020-06-03 | 1 | -1/+0 |
| | |||||
* | reformat python code with black | Bryan Newbold | 2020-06-03 | 1 | -21/+14 |
| | |||||
* | WIP on release-to-sim fetching | Bryan Newbold | 2020-05-19 | 1 | -0/+75 |