Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | handle ext_ids without _id in release schema | Bryan Newbold | 2020-04-09 | 1 | -4/+7 |
| | |||||
* | attempt somewhat more robust abstract cleaning | Bryan Newbold | 2020-04-09 | 1 | -7/+4 |
| | | | | | | Note: there is still a security and robustness issue here in that highlights are marked "safe". Should come up with a better mechanism for escaping/safing. | ||||
* | transform: remove more tags from abstracts | Bryan Newbold | 2020-04-09 | 1 | -1/+1 |
| | |||||
* | transform hacks for new fatcat documents | Bryan Newbold | 2020-04-09 | 1 | -1/+16 |
| | |||||
* | small search tweaks and fixes | Bryan Newbold | 2020-04-08 | 1 | -1/+1 |
| | |||||
* | special-case arxiv/medrxiv/biorxiv container names | Bryan Newbold | 2020-04-08 | 1 | -0/+11 |
| | |||||
* | transform: try to cleanup abstracts | Bryan Newbold | 2020-04-08 | 1 | -3/+31 |
| | |||||
* | include ia_pdf_url when available | Bryan Newbold | 2020-04-03 | 1 | -0/+4 |
| | |||||
* | fixes from prod | Bryan Newbold | 2020-04-03 | 1 | -2/+3 |
| | |||||
* | refactor elastic transform into CLI tool | Bryan Newbold | 2020-04-03 | 1 | -0/+204 |