Commit message (Collapse) | Author | Age | Files | Lines | ||
---|---|---|---|---|---|---|
... | ||||||
| * | ES release: last minor tweaks | Bryan Newbold | 2020-02-26 | 1 | -2/+2 | |
| | | ||||||
| * | ES files: don't remove archive.org domains/hosts | Bryan Newbold | 2020-02-07 | 1 | -5/+0 | |
| | | ||||||
| * | ES releases: host/domain fixes | Bryan Newbold | 2020-01-31 | 1 | -2/+2 | |
| | | ||||||
| * | fix release es transform missing 'issue' | Bryan Newbold | 2020-01-30 | 1 | -0/+1 | |
| | | ||||||
| * | add upper-case work-around from kibana map join | Bryan Newbold | 2020-01-30 | 1 | -0/+1 | |
| | | ||||||
| * | tweak file ES archive.org domain tracking | Bryan Newbold | 2020-01-30 | 1 | -0/+6 | |
| | | ||||||
| * | implement host+domain parsing for file ES transform | Bryan Newbold | 2020-01-30 | 1 | -9/+5 | |
| | | ||||||
| * | fix ES file schema plural field names | Bryan Newbold | 2020-01-29 | 1 | -4/+3 | |
| | | ||||||
| * | elastic schema fixes | Bryan Newbold | 2020-01-29 | 1 | -0/+5 | |
| | | ||||||
| * | add country to v03b release schema | Bryan Newbold | 2020-01-29 | 1 | -0/+2 | |
| | | ||||||
| * | actually implement changelog transform | Bryan Newbold | 2020-01-29 | 1 | -17/+45 | |
| | | ||||||
| * | fix some transform bugs, add some tests | Bryan Newbold | 2020-01-29 | 1 | -6/+8 | |
| | | ||||||
| * | ES release schema updates | Bryan Newbold | 2020-01-29 | 1 | -5/+76 | |
| | | ||||||
| * | container ES schema changes | Bryan Newbold | 2020-01-29 | 1 | -16/+18 | |
| | | ||||||
| * | first implementation of ES file schema | Bryan Newbold | 2020-01-29 | 2 | -1/+46 | |
| | | | | | | | | | | Includes a trivial test and transform, but not any workers or doc updates. | |||||
* | | default to PMC ingest URLs over DOI | Bryan Newbold | 2020-02-04 | 1 | -4/+4 | |
|/ | | | | | | | For cases where there might be both PMC and DOI urls, do the europmc.org PMC ones over DOI option. May want to turn this into a config or command-line option in the future. | |||||
* | remove 'oa_only' feature from ingest transform | Bryan Newbold | 2020-01-28 | 1 | -14/+1 | |
| | | | | Refactoring to move this filter elsewhere | |||||
* | transform ingests via pmc/pmcid, not pubmed/pmid | Bryan Newbold | 2019-12-24 | 1 | -4/+4 | |
| | ||||||
* | update ingest request schema | Bryan Newbold | 2019-12-13 | 1 | -5/+22 | |
| | | | | | This is mostly changing ingest_type from 'file' to 'pdf', and adding 'link_source'/'link_source_id', plus some small cleanups. | |||||
* | tweaks to ingest-file transform | Bryan Newbold | 2019-12-12 | 1 | -13/+7 | |
| | ||||||
* | project -> ingest_request_source | Bryan Newbold | 2019-11-15 | 1 | -2/+2 | |
| | ||||||
* | fix release.pmcid typo | Bryan Newbold | 2019-11-15 | 1 | -2/+2 | |
| | ||||||
* | more ingest importer comments and counts | Bryan Newbold | 2019-11-15 | 1 | -1/+1 | |
| | ||||||
* | add ingest request transform (and test) | Bryan Newbold | 2019-11-15 | 2 | -0/+67 | |
| | ||||||
* | dict wrapper for entity_from_json() | Bryan Newbold | 2019-10-08 | 2 | -3/+7 | |
| | ||||||
* | refactor all python source for client lib name | Bryan Newbold | 2019-09-05 | 3 | -3/+3 | |
| | ||||||
* | comment clarifying container.ident in ES release transform | Bryan Newbold | 2019-09-03 | 1 | -0/+2 | |
| | ||||||
* | fix previous fix (need tests) | Bryan Newbold | 2019-09-03 | 1 | -2/+2 | |
| | ||||||
* | fix typo bug in container ES transform | Bryan Newbold | 2019-09-03 | 1 | -2/+2 | |
| | ||||||
* | use EZB and szczepanski as OA signals (ES) | Bryan Newbold | 2019-09-03 | 1 | -0/+12 | |
| | ||||||
* | elasticsearch transform: fix url.url bug | Bryan Newbold | 2019-05-24 | 1 | -11/+11 | |
| | ||||||
* | add 'superceded' release extra flag to elastic schema | Bryan Newbold | 2019-05-23 | 1 | -0/+1 | |
| | ||||||
* | also track work_id in release elasticsearch table | Bryan Newbold | 2019-05-22 | 1 | -0/+1 | |
| | ||||||
* | count linked refs (not just raw refs) in elasticsearch | Bryan Newbold | 2019-05-22 | 1 | -0/+3 | |
| | ||||||
* | include creator_ids in release elastic schema | Bryan Newbold | 2019-05-20 | 1 | -0/+6 | |
| | | | | Intent is to allow fast creator search/lookup | |||||
* | elastic release schema update | Bryan Newbold | 2019-05-20 | 1 | -2/+5 | |
| | ||||||
* | improved CSL transform (structured author names) | Bryan Newbold | 2019-05-20 | 1 | -12/+11 | |
| | ||||||
* | make some XXX into TODO | Bryan Newbold | 2019-05-20 | 1 | -2/+2 | |
| | ||||||
* | fix elastic file pdf check | Bryan Newbold | 2019-05-16 | 1 | -1/+3 | |
| | ||||||
* | elastic transforms: work around missing pdf mimetypes | Bryan Newbold | 2019-05-15 | 1 | -1/+1 | |
| | ||||||
* | partial python impl of ext_id and release_stage refactors | Bryan Newbold | 2019-05-13 | 2 | -14/+15 | |
| | ||||||
* | handle null abstracts for release | Bryan Newbold | 2019-05-07 | 1 | -1/+1 | |
| | ||||||
* | improve test coverage | Bryan Newbold | 2019-04-04 | 1 | -0/+1 | |
| | ||||||
* | expose bibtex and citeproc; revert /unstable/ prefixes | Bryan Newbold | 2019-03-18 | 1 | -1/+1 | |
| | ||||||
* | refactor and test citeproc code | Bryan Newbold | 2019-03-18 | 2 | -3/+55 | |
| | ||||||
* | more integration of transform refactor | Bryan Newbold | 2019-03-11 | 1 | -2/+2 | |
| | ||||||
* | refactor transforms into sub-dir | Bryan Newbold | 2019-03-11 | 4 | -0/+532 | |