Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | stats.json update after releases v03 cut-over | Bryan Newbold | 2019-06-06 | 1 | -0/+1 |
| | |||||
* | elasticsearch index alias howto | Bryan Newbold | 2019-06-06 | 1 | -1/+16 |
| | |||||
* | update TODO | Bryan Newbold | 2019-06-06 | 1 | -0/+13 |
| | |||||
* | add changelog link to header | Bryan Newbold | 2019-06-05 | 1 | -0/+1 |
| | |||||
* | fixes to lookup views | Bryan Newbold | 2019-06-05 | 4 | -11/+22 |
| | |||||
* | update tests for lookup views | Bryan Newbold | 2019-06-05 | 1 | -3/+3 |
| | |||||
* | more lookup views | Bryan Newbold | 2019-06-05 | 7 | -37/+279 |
| | |||||
* | release lookup view | Bryan Newbold | 2019-06-05 | 4 | -10/+162 |
| | |||||
* | QA checks (for hash, extid duplication) | Bryan Newbold | 2019-06-04 | 4 | -0/+82 |
| | |||||
* | recent prod table sizes; 380 GBytes or so total | Bryan Newbold | 2019-06-04 | 1 | -0/+233 |
| | |||||
* | dump_release_extid.sql changes for new schema | Bryan Newbold | 2019-06-03 | 1 | -1/+1 |
| | |||||
* | move export README info to sql_dumps doc | Bryan Newbold | 2019-06-03 | 4 | -21/+30 |
| | |||||
* | more pubmed importer fixes | Bryan Newbold | 2019-06-03 | 1 | -6/+13 |
| | |||||
* | recent bootstrap/import notes | Bryan Newbold | 2019-06-03 | 3 | -0/+495 |
| | |||||
* | JALC: handle empty publisher string | Bryan Newbold | 2019-05-30 | 1 | -3/+4 |
| | |||||
* | update journal_metadata import command | Bryan Newbold | 2019-05-30 | 1 | -3/+3 |
| | |||||
* | fix parse_merge_metadata.py merge_spans() | Bryan Newbold | 2019-05-30 | 1 | -4/+8 |
| | |||||
* | stabilize sort order for entity metadata | Bryan Newbold | 2019-05-30 | 1 | -2/+2 |
| | |||||
* | journal_metadata import: allow KBART updates | Bryan Newbold | 2019-05-30 | 1 | -5/+16 |
| | |||||
* | better KBART merging | Bryan Newbold | 2019-05-30 | 1 | -4/+5 |
| | |||||
* | initial code to handle multiple KBART spans better | Bryan Newbold | 2019-05-30 | 1 | -2/+64 |
| | |||||
* | fix and workaround container entities in release topic | Bryan Newbold | 2019-05-30 | 2 | -2/+8 |
| | |||||
* | fix syntax bugs (container elastic worker) | Bryan Newbold | 2019-05-30 | 2 | -6/+6 |
| | |||||
* | add work-in-progress elastic index notes | Bryan Newbold | 2019-05-30 | 1 | -0/+11 |
| | |||||
* | add container update elastic worker | Bryan Newbold | 2019-05-30 | 3 | -6/+42 |
| | |||||
* | file and container update kafka topics | Bryan Newbold | 2019-05-30 | 2 | -56/+77 |
| | |||||
* | update elastic for releases when files added | Bryan Newbold | 2019-05-30 | 1 | -1/+36 |
| | | | | A bunch of remaining TODOs here | ||||
* | make importer extid lookups faster by hiding | Bryan Newbold | 2019-05-29 | 1 | -2/+2 |
| | |||||
* | fix 'fd' XML pattern for bulk imports | Bryan Newbold | 2019-05-29 | 1 | -3/+2 |
| | |||||
* | yet another pubmed weird DOI corner case | Bryan Newbold | 2019-05-29 | 1 | -1/+1 |
| | |||||
* | handle pubmed CollectiveName null-ness | Bryan Newbold | 2019-05-29 | 1 | -1/+1 |
| | |||||
* | tweak JALC tests for english swaperoo | Bryan Newbold | 2019-05-29 | 1 | -2/+2 |
| | |||||
* | remove stray JALC debug code | Bryan Newbold | 2019-05-29 | 1 | -2/+3 |
| | |||||
* | is_cjk() handles kanji better | Bryan Newbold | 2019-05-29 | 1 | -4/+6 |
| | |||||
* | improve JALC author handling | Bryan Newbold | 2019-05-29 | 1 | -59/+85 |
| | |||||
* | handle empty retraction_of.PMID in pubmed importer | Bryan Newbold | 2019-05-29 | 1 | -2/+4 |
| | |||||
* | faster LargeFile XML importer for PubMed | Bryan Newbold | 2019-05-29 | 4 | -5/+55 |
| | |||||
* | elasticsearch transform: fix url.url bug | Bryan Newbold | 2019-05-24 | 1 | -11/+11 |
| | |||||
* | more MARC languages, and less verbose reporting | Bryan Newbold | 2019-05-24 | 2 | -4/+15 |
| | |||||
* | better crossref container_name handling | Bryan Newbold | 2019-05-24 | 1 | -7/+12 |
| | |||||
* | push elastic release updates to new (schema) index | Bryan Newbold | 2019-05-24 | 1 | -1/+1 |
| | |||||
* | Merge branch 'bnewbold-import-tweaks' | Bryan Newbold | 2019-05-23 | 5 | -7/+13 |
|\ | |||||
| * | set superceded flag on 'old' arxiv releases | Bryan Newbold | 2019-05-23 | 2 | -1/+7 |
| | | |||||
| * | add 'superceded' release extra flag to elastic schema | Bryan Newbold | 2019-05-23 | 3 | -6/+6 |
| | | |||||
* | | new kafka topic for release obj w/ v0.3 schema | Bryan Newbold | 2019-05-23 | 1 | -2/+2 |
| | | |||||
* | | migration notes | Bryan Newbold | 2019-05-23 | 2 | -0/+44 |
|/ | |||||
* | update CHANGELOG for v0.3v0.3.0 | Bryan Newbold | 2019-05-23 | 1 | -1/+8 |
| | |||||
* | minor cleanups to SQL migration | Bryan Newbold | 2019-05-23 | 1 | -11/+1 |
| | |||||
* | updated import instructions | Bryan Newbold | 2019-05-23 | 1 | -0/+42 |
| | |||||
* | pubmed DOIs need strip() | Bryan Newbold | 2019-05-22 | 1 | -1/+1 |
| |