Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | datacite: resolve formatting issues in tests | Martin Czygan | 2020-07-10 | 96 | -340/+182 |
|\ | |||||
| * | Merge branch 'martin-arxiv-fix-http-503' into 'master' | bnewbold | 2020-07-10 | 1 | -1/+1 |
| |\ | | | | | | | | | | | | | arxiv: address 503, "Retry after specified interval" error See merge request webgroup/fatcat!64 | ||||
| | * | arxiv: do retry five times of HTTP 503 | Martin Czygan | 2020-07-10 | 1 | -1/+1 |
| | | | |||||
| * | | get mediawiki username creation working with spaces | Bryan Newbold | 2020-07-09 | 1 | -1/+2 |
| | | | |||||
| * | | datacite: fix attribute error | Martin Czygan | 2020-07-07 | 1 | -1/+1 |
| |/ | | | | | | | refs: #44035 | ||||
| * | tweak flake8 params | Bryan Newbold | 2020-07-01 | 1 | -2/+8 |
| | | |||||
| * | lint (flake8) python test files | Bryan Newbold | 2020-07-01 | 45 | -168/+71 |
| | | |||||
| * | lint (flake8) tool python files | Bryan Newbold | 2020-07-01 | 33 | -130/+46 |
| | | |||||
| * | lint (flake8) web interface python files | Bryan Newbold | 2020-07-01 | 7 | -26/+16 |
| | | |||||
| * | lint (flake8) top-level python files | Bryan Newbold | 2020-07-01 | 8 | -25/+11 |
| | | |||||
| * | updates to Makefile | Bryan Newbold | 2020-07-01 | 2 | -5/+32 |
| | | |||||
| * | reviewer: fix bugs in common code found by mypy | Bryan Newbold | 2020-07-01 | 1 | -2/+3 |
| | | |||||
| * | update TODO with some old examples | Bryan Newbold | 2020-07-01 | 1 | -0/+10 |
| | | |||||
* | | datacite: adjust tests | Martin Czygan | 2020-07-10 | 4 | -10/+6 |
| | | |||||
* | | datacite: there should be no index gaps | Martin Czygan | 2020-07-10 | 1 | -2/+8 |
| | | |||||
* | | datacite: document contributor types | Martin Czygan | 2020-07-10 | 1 | -0/+25 |
| | | |||||
* | | wip: contrib, GH59 | Martin Czygan | 2020-07-10 | 2 | -245/+383 |
| | | |||||
* | | wip: contrib, GH59 | Martin Czygan | 2020-07-10 | 5 | -3/+105 |
| | | |||||
* | | datacite: address duplicated contributor issue | Martin Czygan | 2020-07-07 | 6 | -11/+110 |
|/ | | | | | | | Use string comparison. * https://fatcat.wiki/release/spjysmrnsrgyzgq6ise5o44rlu/contribs * https://api.datacite.org/dois/10.25940/roper-31098406 | ||||
* | add new license mappings | Bryan Newbold | 2020-06-30 | 2 | -0/+27 |
| | |||||
* | datacite: improve license mapping | Martin Czygan | 2020-06-30 | 2 | -9/+29 |
| | | | | via "missed potential license", refs #58 | ||||
* | datacite: hard cast possible date value to string | Martin Czygan | 2020-06-29 | 2 | -1/+2 |
| | |||||
* | disallow a specific unicode character from DOIs | Bryan Newbold | 2020-06-26 | 1 | -0/+6 |
| | |||||
* | make fulltext-only label clickable | Martin Czygan | 2020-06-16 | 1 | -2/+2 |
| | |||||
* | Merge branch 'bnewbold-better-button-links' into 'master' | Martin Czygan | 2020-06-05 | 4 | -4/+18 |
|\ | | | | | | | | | better download button links See merge request webgroup/fatcat!57 | ||||
| * | use ES 'best_url' in file download pages | Bryan Newbold | 2020-06-04 | 2 | -2/+4 |
| | | | | | | | | Similar to recent change for release download pages. | ||||
| * | ES schema: add best_url to file schema | Bryan Newbold | 2020-06-04 | 1 | -0/+12 |
| | | | | | | | | | | | | | | | | | | This will increase index size (URLs are often long in our corpus, and we have many file entities), but seems worth it. Initially added `ia_url` as a second field, guaranteed to always be an *.archive.org URL, but `best_url` defaults to that anyways so didn't seem worthwhile. | ||||
| * | re-use 'best pdf url' for release green button | Bryan Newbold | 2020-06-04 | 1 | -2/+2 |
| | | | | | | | | | | | | | | I thought this was the existing behavior, but it looks like we were just taking the first link from the first file. In the future may refactor this out even further. | ||||
* | | fix 'dev' target in python makefile | Bryan Newbold | 2020-06-04 | 1 | -1/+1 |
|/ | |||||
* | Merge remote-tracking branch 'origin/martin-harvest-fail-on-400' | Bryan Newbold | 2020-05-29 | 1 | -4/+0 |
|\ | | | | | | | | | | | Manually resolved conflicts: python/fatcat_tools/harvest/doi_registrars.py | ||||
| * | harvest: fail on HTTP 400 | Martin Czygan | 2020-05-29 | 1 | -4/+0 |
| | | | | | | | | | | | | | | | | | | In the past harvest of datacite resulted in occasional HTTP 400. Meanwhile, various API bugs have been fixed (most recently: https://github.com/datacite/lupo/pull/537, https://github.com/datacite/datacite/issues/1038). Downside of ignoring this error was that state lives in kafka, which has limited support for deletion of arbitrary messages from a topic. | ||||
* | | Merge branch 'martin-datacite-harvest-log-output' into 'master' | Martin Czygan | 2020-05-29 | 1 | -1/+1 |
|\ \ | | | | | | | | | | | | | harvest: log the failed url See merge request webgroup/fatcat!55 | ||||
| * | | harvest: log the failed url | Martin Czygan | 2020-05-29 | 1 | -1/+1 |
| |/ | |||||
* / | datacite: fix test docs | Martin Czygan | 2020-05-29 | 1 | -3/+3 |
|/ | |||||
* | Merge branch 'bnewbold-ingest-stage' into 'master' | Martin Czygan | 2020-05-28 | 3 | -7/+46 |
|\ | | | | | | | | | verify release_stage in ingest importer See merge request webgroup/fatcat!52 | ||||
| * | ingest importer: check that stage is consistent with release | Bryan Newbold | 2020-05-26 | 1 | -0/+5 |
| | | |||||
| * | regression test for release_stage mismatch with ingest request | Bryan Newbold | 2020-05-26 | 2 | -7/+41 |
| | | |||||
* | | rename HarvestState.next() to HarvestState.next_span() | Bryan Newbold | 2020-05-26 | 5 | -7/+7 |
|/ | | | | | | | | | "span" short for "timespan" to harvest; there may be a better name to use. Motivation for this is to work around a pylint erorr that .next() was not callable. This might be a bug with pylint, but .next() is also a very generic name. | ||||
* | add a work-in-progress python Makefile | Bryan Newbold | 2020-05-26 | 1 | -0/+24 |
| | |||||
* | pylintrc: skip many spurious WTForm no-member errors | Bryan Newbold | 2020-05-26 | 1 | -0/+2 |
| | |||||
* | HACK: try to squelch pylint in CI | Bryan Newbold | 2020-05-26 | 1 | -2/+2 |
| | | | | | | | | | | | | | | | | | Gitlab CI is showing lint errors like: =================================== FAILURES =================================== 6316 _______________________ [pylint] tests/harvest_state.py ________________________ 6317 E: 19,11: hs.next is not callable (not-callable) 6318 E: 33,11: hs.next is not callable (not-callable) 6319 E: 19,11: hs.next is not callable (not-callable) [...] this is confusing as we use pipenv with a lock, so I should see the exact same errors locally. This commit is a hack to try and fix this and unbreak builds until we can debug further. | ||||
* | HACK: skip pylint errors on lines that seem to be fine | Bryan Newbold | 2020-05-22 | 3 | -3/+3 |
| | | | | | It seems to be an inadvertantly ugraded version of pylint saying that these lines are not-callable. | ||||
* | pipenv: add flake8 | Bryan Newbold | 2020-05-22 | 2 | -183/+213 |
| | |||||
* | Merge remote-tracking branch 'github/master' | Bryan Newbold | 2020-05-22 | 2 | -7/+7 |
|\ | |||||
| * | Indentity is not the same this as equality in Python | Christian Clauss | 2020-05-14 | 1 | -2/+2 |
| | | |||||
| * | Indentity is not the same this as equality in Python | Christian Clauss | 2020-05-14 | 1 | -5/+5 |
| | | |||||
* | | importers: clarify handling of ApiException | Bryan Newbold | 2020-05-22 | 3 | -4/+10 |
| | | | | | | | | | | | | | | | | One of these (in ingest importer pipeline) is an actual bug, the others are just changing the syntax to be more explicit/conservative. The ingest importer bug seems to have resulted in some bad file match imports; scale of impact is unknown. | ||||
* | | ingest importer: don't use glutton matches | Bryan Newbold | 2020-05-22 | 1 | -3/+3 |
| | | | | | | | | | | | | | | Until reviewing I didn't realize we were even doing this currently. Hopefluly has not impacted too many imports, as almost all ingests use an external identifer, so only those with identifers not in fatcat for whatever reason. | ||||
* | | Merge branch 'martin-fix-container-empty-search' into 'master' | Martin Czygan | 2020-04-29 | 1 | -0/+4 |
|\ \ | | | | | | | | | | | | | search: assume * when q is not set or empty See merge request webgroup/fatcat!51 | ||||
| * | | search: assume * when q is not set or empty | Martin Czygan | 2020-04-29 | 1 | -0/+4 |
| | | | | | | | | | | | | An example would be a blank search from a container details page. |