Commit message (Collapse) | Author | Age | Files | Lines | ||
---|---|---|---|---|---|---|
... | ||||||
| * | datacite: add missing mappings and notes | Martin Czygan | 2019-12-28 | 1 | -266/+175 | |
| | | ||||||
| * | datacite: basic field mappings | Martin Czygan | 2019-12-28 | 1 | -41/+181 | |
| | | | | | | | | | | | | | | | | | | | | Currently using two external libraries: * dateparser * langcodes Note: This commit includes lots of wip docs and field stat in comment, which should be removed. | |||||
| * | datacite: importer skeleton | Martin Czygan | 2019-12-28 | 4 | -0/+514 | |
| | | | | | | | | | | | | * contributors, title, date, publisher, container, license Field and value analysis via https://github.com/miku/indigo. | |||||
* | | 2019-01-07 status update | Bryan Newbold | 2020-01-07 | 2 | -0/+36 | |
| | | ||||||
* | | chocula bulk edit note | Bryan Newbold | 2020-01-07 | 2 | -0/+15 | |
| | | ||||||
* | | importers: control update behavior with more-standard flag | Bryan Newbold | 2020-01-06 | 6 | -3/+15 | |
| | | ||||||
* | | proposals: standardize a bit | Bryan Newbold | 2020-01-03 | 9 | -3/+34 | |
| | | ||||||
* | | notes on search query parsing (WIP) | Bryan Newbold | 2020-01-03 | 1 | -0/+22 | |
| | | ||||||
* | | fatcat identifiers proposal (WIP) | Bryan Newbold | 2020-01-03 | 1 | -0/+25 | |
| | | ||||||
* | | proposal: python3.7 upgrade | Bryan Newbold | 2020-01-03 | 1 | -0/+101 | |
| | | ||||||
* | | pipenv: update pytest to 5.x; remove langcodes | Bryan Newbold | 2020-01-03 | 2 | -108/+85 | |
| | | | | | | | | | | | | | | | | | | | | pytest has been pinned to the 4.x series to work around a test import package mangling problem with citeproc_styles. Now that pytest.ini explicitly lists test files, this seems to no longer be a problem and pytest can be updated to the most recent version. Also re-locked Pipfile.lock with updated dependencies (only minor changes). | |||||
* | | pytest: explicitly indicate all in-scope test files | Bryan Newbold | 2020-01-03 | 1 | -3/+1 | |
| | | | | | | | | | | | | | | | | | | | | | | The purpose of this change is to test errors when pytest tries to recursively update assertion statements in all dependent packages. The reason pytest does this is to add pretty printing, which is nice, but probably shouldn't be done in all dependency libraries. This fixes test problems with both CSL (citeproc_styles) and dateparser (when actually imported in code, which currently on master does not happen). | |||||
* | | scholix schema links/proposal | Bryan Newbold | 2020-01-03 | 1 | -0/+3 | |
| | | ||||||
* | | update bulk edit CHANGELOG and orcid notes | Bryan Newbold | 2019-12-31 | 2 | -13/+49 | |
| | | ||||||
* | | Merge branch 'martin-guide-entity-release-fix' into 'master' | bnewbold | 2019-12-31 | 1 | -5/+5 | |
|\ \ | |/ |/| | | | | | remove duplicate fields in entity release See merge request webgroup/fatcat!11 | |||||
| * | document year and date of withdrawn release | Martin Czygan | 2019-12-17 | 1 | -1/+5 | |
| | | ||||||
| * | remove duplicate fields in entity release | Martin Czygan | 2019-12-17 | 1 | -4/+0 | |
| | | ||||||
* | | bulk edit updates | Bryan Newbold | 2019-12-26 | 1 | -3/+4 | |
| | | ||||||
* | | orcid: skip non-person ORCID records | Bryan Newbold | 2019-12-26 | 1 | -0/+4 | |
| | | ||||||
* | | Merge branch 'martin-datacite-daily-harvest' into 'master' | Martin Czygan | 2019-12-26 | 3 | -5/+73 | |
|\ \ | | | | | | | | | | | | | Datacite daily harvest See merge request webgroup/fatcat!6 | |||||
| * | | datacite: fix harvest test | Martin Czygan | 2019-12-27 | 1 | -1/+1 | |
| | | | | | | | | | | | | | | | | | | Produced messages should match: jq '.data|length' tests/files/datacite_api.json | |||||
| * | | datacite: add simple test and fixture for datacite api interaction | Martin Czygan | 2019-12-27 | 2 | -0/+46 | |
| | | | ||||||
| * | | datacite: extend range search query | Martin Czygan | 2019-12-27 | 1 | -1/+1 | |
| | | | | | | | | | | | | | | | The bracket syntax is inclusive. See also: https://www.elastic.co/guide/en/elasticsearch/reference/7.5/query-dsl-query-string-query.html#_ranges | |||||
| * | | avoid usage of short links | Martin Czygan | 2019-12-27 | 1 | -2/+2 | |
| | | | ||||||
| * | | Datacite API v2 throws 400, we cannot recover from, currently. | Martin Czygan | 2019-12-27 | 1 | -0/+4 | |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | As a first iteration, just mark the daily batch complete and continue. The occasional HTTP 400 issue has been reported as https://github.com/datacite/datacite/issues/897. A possible improvement would be to shrink the window, so losses will be smaller. | |||||
| * | | datacite: update documentation, add links to issues | Martin Czygan | 2019-12-27 | 1 | -10/+5 | |
| | | | ||||||
| * | | datacite: use v2 of the API (flaky) | Martin Czygan | 2019-12-27 | 1 | -5/+28 | |
|/ / | | | | | | | | | | | | | | | | | Update parameter update for datacite API v2. Works fine, but there are occasional HTTP 400 responses when using the cursor API (daily updates can exceed the 10000 record limit for search queries). The HTTP 400 issue is not solved yet, but reported to datacite as https://github.com/datacite/datacite/issues/897. | |||||
* | | transform ingests via pmc/pmcid, not pubmed/pmid | Bryan Newbold | 2019-12-24 | 1 | -4/+4 | |
| | | ||||||
* | | allow arabesque backfill ingests for some source types | Bryan Newbold | 2019-12-24 | 1 | -0/+5 | |
| | | ||||||
* | | make chocula URL updates more conservative | Bryan Newbold | 2019-12-24 | 1 | -5/+5 | |
| | | ||||||
* | | pubmed: if doing update, also do subtitle schema update | Bryan Newbold | 2019-12-23 | 1 | -1/+9 | |
| | | ||||||
* | | doi parsing fixes | Bryan Newbold | 2019-12-23 | 1 | -0/+7 | |
| | | | | | | | | | | | | | | | | | | | | Replace emdash with regular dash. Replace double slash after partner ID with single slash. This conversion seems to be done by crossref automatically on lookup. I tried several examples, using doi.org resolver and Crossref API lookup. Note that there are a number of fatcat entities with '//' in the DOI. | |||||
* | | pubmed bulk import notes (from QA) | Bryan Newbold | 2019-12-23 | 1 | -0/+45 | |
| | | ||||||
* | | pubmed: improve warning and stderr formatting | Bryan Newbold | 2019-12-23 | 1 | -5/+6 | |
| | | ||||||
* | | pubmed: use standard identifier cleaners | Bryan Newbold | 2019-12-23 | 1 | -17/+14 | |
| | | ||||||
* | | pubmed: remove unused extid mapping code | Bryan Newbold | 2019-12-23 | 1 | -29/+0 | |
| | | ||||||
* | | pubmed: do reference lookups by default | Bryan Newbold | 2019-12-23 | 1 | -1/+1 | |
| | | ||||||
* | | normalizers: clean_pmid(), and handle nulls in all other cleaners | Bryan Newbold | 2019-12-23 | 1 | -0/+31 | |
| | | ||||||
* | | pubmed: null doi parsing check | Bryan Newbold | 2019-12-23 | 1 | -1/+1 | |
| | | ||||||
* | | add basic MedlineDate year parsing | Bryan Newbold | 2019-12-23 | 1 | -0/+11 | |
| | | ||||||
* | | add regression test for medlinedate -> year parsing | Bryan Newbold | 2019-12-23 | 2 | -0/+102 | |
| | | ||||||
* | | arxiv bulk update notes | Bryan Newbold | 2019-12-22 | 2 | -2/+49 | |
| | | ||||||
* | | fix spn/ingest importer duplication check | Bryan Newbold | 2019-12-22 | 1 | -6/+8 | |
| | | | | | | | | | | | | Check was happing after the `return True` by mistake, allowing duplicates in SPN editgroups, and potentially in ingest request editgroups as well. | |||||
* | | DB loads take a long time now | Bryan Newbold | 2019-12-21 | 1 | -1/+1 | |
| | | ||||||
* | | Merge branch 'bnewbold-datacite-ui' into 'master' | bnewbold | 2019-12-21 | 2 | -9/+13 | |
|\ \ | | | | | | | | | | | | | datacite release links and metadata expansion See merge request webgroup/fatcat!15 | |||||
| * | | datacite release links and metadata expansion | Bryan Newbold | 2019-12-20 | 2 | -9/+13 | |
|/ / | | | | | | | | | | | | | Small ergonomic changes for datacite releases: - add a link to live/current datacite metadata (like we do for Crossref) - expand "extra" metadata fields under 'datacite' dict in metadata view | |||||
* | | add 2019-12-20 stats | Bryan Newbold | 2019-12-20 | 2 | -0/+148 | |
| | | ||||||
* | | spn: incluce link_source/link_source_id in ingest request | Bryan Newbold | 2019-12-20 | 1 | -0/+2 | |
| | | ||||||
* | | Merge branch 'bnewbold-pipenv-20191217' into 'master' | bnewbold | 2019-12-18 | 2 | -286/+413 | |
|\ \ | | | | | | | | | | | | | pipenv updates See merge request webgroup/fatcat!13 | |||||
| * | | pipenv: update deps | Bryan Newbold | 2019-12-17 | 2 | -11/+55 | |
| | | | | | | | | | | | | | | | | | | | | | | | | loginpass patches got accepted upstream a while back, so don't need to pin to a git version ipython 7.10 seems to have problems installing, so restricting to earlier 6.x versions |