| Commit message (Expand) | Author | Age | Files | Lines |
... | |
* | DOAJ import notes | Bryan Newbold | 2020-12-17 | 2 | -2/+23 |
* | add dblp as an ingest source and identifier | Bryan Newbold | 2020-12-17 | 1 | -1/+2 |
* | ingest: allow doaj ingest responses | Bryan Newbold | 2020-12-17 | 1 | -1/+2 |
* | bug fix: is_preserved should always be bool | Bryan Newbold | 2020-12-17 | 1 | -2/+2 |
* | Merge branch 'bnewbold-doaj-fuzzy' into 'master' | bnewbold | 2020-12-18 | 7 | -267/+544 |
|\ |
|
| * | update fuzzy helper to pass 'reason' through to import code | Bryan Newbold | 2020-12-17 | 2 | -5/+5 |
| * | pipenv: bump fuzzycat to 0.1.9 | Bryan Newbold | 2020-12-17 | 2 | -5/+5 |
| * | add fuzzy match filtering to DOAJ importer | Bryan Newbold | 2020-12-16 | 2 | -4/+23 |
| * | add fuzzy matching helper to importer base class | Bryan Newbold | 2020-12-16 | 3 | -2/+147 |
| * | pipenv: add fuzzycat dependency | Bryan Newbold | 2020-12-16 | 2 | -261/+374 |
* | | Merge pull request #65 from ibnesayeed/patch-1 | bnewbold | 2020-12-17 | 1 | -1/+1 |
|\ \ |
|
| * | | Improve status counting efficiency | Sawood Alam | 2020-12-17 | 1 | -1/+1 |
* | | | Merge branch 'bnewbold-es-transform-html' into 'master' | Martin Czygan | 2020-12-17 | 5 | -146/+296 |
|\ \ \
| |_|/
|/| | |
|
| * | | entity update worker: treat fileset and webcapture updates like file updates | Bryan Newbold | 2020-12-16 | 1 | -3/+25 |
| * | | fix indentation | Bryan Newbold | 2020-12-16 | 1 | -2/+2 |
| * | | have release elasticsearch transform count webcaptures and filesets towards p... | Bryan Newbold | 2020-12-16 | 1 | -26/+57 |
| * | | improve release elasticsearch transform test coverage | Bryan Newbold | 2020-12-16 | 3 | -11/+86 |
| * | | small release_to_elasticsearch refactors | Bryan Newbold | 2020-12-16 | 1 | -7/+12 |
| * | | refactor release_to_elasticsearch transform | Bryan Newbold | 2020-12-16 | 1 | -131/+148 |
|/ / |
|
* | | html ingest: small fixes to try_update() code path | Bryan Newbold | 2020-12-15 | 1 | -5/+5 |
* | | notes on partial-progress DOAJ release metadata import | Bryan Newbold | 2020-12-14 | 1 | -0/+105 |
* | | bulk import notes on ORCID | Bryan Newbold | 2020-12-14 | 1 | -0/+55 |
* | | Revert "gitlab CI: explicitly use xenial tag of image" | Bryan Newbold | 2020-12-11 | 1 | -1/+1 |
* | | Revert "docker xenial base image: include python3.8" | Bryan Newbold | 2020-12-11 | 1 | -6/+1 |
* | | gitlab CI: explicitly use xenial tag of image | Bryan Newbold | 2020-12-11 | 1 | -1/+1 |
* | | docker xenial base image: include python3.8 | Bryan Newbold | 2020-12-11 | 1 | -1/+6 |
* | | HACK: squash intermitent failure of detect_text_lang() test | Bryan Newbold | 2020-12-11 | 1 | -1/+2 |
* | | guide: small updates to container extra schema notes (from dblp work) | Bryan Newbold | 2020-12-11 | 1 | -2/+7 |
* | | bulk edits: note ORCID update | Bryan Newbold | 2020-12-11 | 1 | -1/+5 |
* | | docker: how to push to dockerhub | Bryan Newbold | 2020-12-11 | 1 | -0/+4 |
* | | Merge branch 'bnewbold-doaj-metadata' into 'master' | Martin Czygan | 2020-11-24 | 37 | -1549/+2845 |
|\ \ |
|
| * | | cargo: update sentry to fix memory initialization issue | Bryan Newbold | 2020-11-20 | 2 | -274/+332 |
| * | | DOAJ: remove accidentally commited 'skip' of a test | Bryan Newbold | 2020-11-20 | 1 | -1/+0 |
| * | | langdetect: more text for 'zh' test case | Bryan Newbold | 2020-11-20 | 1 | -1/+1 |
| * | | DOAJ: update importer README with example invocation | Bryan Newbold | 2020-11-20 | 1 | -0/+7 |
| * | | crossref+datacite: remove confusing early update bail | Bryan Newbold | 2020-11-20 | 2 | -4/+0 |
| * | | doaj: fix update code path (getattr not __dict__) | Bryan Newbold | 2020-11-20 | 3 | -15/+70 |
| * | | DOAJ: handle empty identifier 'id' case | Bryan Newbold | 2020-11-20 | 1 | -0/+2 |
| * | | clean DOI: ban all non-ASCII characters | Bryan Newbold | 2020-11-19 | 1 | -1/+4 |
| * | | normal: handle langdetect of 'zh-cn' (not len=2) | Bryan Newbold | 2020-11-19 | 1 | -0/+3 |
| * | | update fatcatd rust code for 'oai' external identifier | Bryan Newbold | 2020-11-19 | 4 | -11/+189 |
| * | | codegen rust schema crate | Bryan Newbold | 2020-11-19 | 6 | -3/+20 |
| * | | codegen python openapi client | Bryan Newbold | 2020-11-19 | 2 | -4/+36 |
| * | | schema: also add 'oai' identifer (OAI-PMH) for releases | Bryan Newbold | 2020-11-19 | 2 | -2/+9 |
| * | | tweak DOAJ importer class args and default for do_updates | Bryan Newbold | 2020-11-19 | 1 | -2/+2 |
| * | | show DOAJ (and dblp) identifiers in release view | Bryan Newbold | 2020-11-19 | 1 | -1/+7 |
| * | | if a release has DOAJ article id, count as OA | Bryan Newbold | 2020-11-19 | 1 | -0/+3 |
| * | | implement remainder of DOAJ article importer | Bryan Newbold | 2020-11-19 | 3 | -68/+168 |
| * | | handle more non-ASCII DOI cases | Bryan Newbold | 2020-11-19 | 1 | -1/+3 |
| * | | more python normalizers, and move from importer common | Bryan Newbold | 2020-11-19 | 2 | -154/+326 |