|  | Commit message (Collapse) | Author | Age | Files | Lines | 
|---|
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | Frequently when looking at preservation coverage of journals, the
current year shows as "un-preserved" when in fact there is robust KBART
(keepers, eg CLOCKSS/Portico) coverage. This is partially because we
don't update containers with KBART year spans very frequently (which is
on us), and partially because KBART reports are often a bit out of day
(eg, doesn't show coverage for the current year. For that matter, they
probably take a few months to update the previous year as well, but that
is a larger time span to fudge over.
This patch means we will count Portico/LOCKSS/etc coverage for "last
year" to count as coverage of publications dated "this year". Note that
for this to be effective/correct, it is assumed that we will update
containers with coverage year spans at least once a year, and that we
will re-index all releases at least once a year. | 
| | |  | 
| | |  | 
| |\  
| | 
| | 
| | 
| | | datacite: address duplicated contributor issue
See merge request webgroup/fatcat!65 | 
| | |\ |  | 
| | | | |  | 
| | | | |  | 
| | | | |  | 
| | | | |  | 
| | | | |  | 
| | | | 
| | | 
| | | 
| | | 
| | | 
| | | 
| | | | Use string comparison.
* https://fatcat.wiki/release/spjysmrnsrgyzgq6ise5o44rlu/contribs
* https://api.datacite.org/dois/10.25940/roper-31098406 | 
| |\ \ \  
| |_|/  
|/| |   
| | |   
| | | | datacite: mitigate sentry #44035
See merge request webgroup/fatcat!66 | 
| | | | 
| | | 
| | | 
| | | 
| | | 
| | | 
| | | 
| | | 
| | | 
| | | 
| | | 
| | | 
| | | 
| | | 
| | | 
| | | 
| | | | According to sentry, running `c.get('nameIdentifiers', []) or []` on a c with value:
```
{'affiliation': [],
 'familyName': 'Guidon',
 'givenName': 'Manuel',
 'nameIdentifiers': {'nameIdentifier': 'https://orcid.org/0000-0003-3543-6683',
                     'nameIdentifierScheme': 'ORCID',
                     'schemeUri': 'https://orcid.org'},
 'nameType': 'Personal'}
```
results in a string, which I cannot reproduce. The document in question at:
https://api.datacite.org/dois/10.26275/kuw1-fdls seems fine, too. | 
| |\ \ \  
| | | | 
| | | | 
| | | | 
| | | | | arxiv: address 503, "Retry after specified interval" error
See merge request webgroup/fatcat!64 | 
| | | | | |  | 
| | | | | |  | 
| |\ \ \ \  
| |/ / /  
|/| / /   
| |/ /    
| | | | datacite: fix attribute error
See merge request webgroup/fatcat!63 | 
| |/ /  
| |   
| |   
| | | refs: #44035 | 
| |\ \  
| | | 
| | | 
| | | 
| | | | lint cleanups
See merge request webgroup/fatcat!62 | 
| | | | |  | 
| | | | |  | 
| | | | |  | 
| | | | |  | 
| |/ / |  | 
| | | |  | 
| | | |  | 
| | | |  | 
| | | |  | 
| | | |  | 
| | | |  | 
| |/ |  | 
| | |  | 
| | |  | 
| | 
| 
| 
| | via "missed potential license", refs #58 | 
| |\  
| | 
| | 
| | 
| | | datacite: hard cast possible date value to string
See merge request webgroup/fatcat!59 | 
| |/ |  | 
| | |  | 
| | |  | 
| |\  
| | 
| | 
| | 
| | | make fulltext-only label clickable
See merge request webgroup/fatcat!58 | 
| |/ |  | 
| |\  
| | 
| | 
| | 
| | | better download button links
See merge request webgroup/fatcat!57 | 
| | | 
| | 
| | 
| | | Similar to recent change for release download pages. | 
| | | 
| | 
| | 
| | 
| | 
| | 
| | 
| | 
| | | This will increase index size (URLs are often long in our corpus, and we
have many file entities), but seems worth it.
Initially added `ia_url` as a second field, guaranteed to always be an
*.archive.org URL, but `best_url` defaults to that anyways so didn't
seem worthwhile. | 
| | | 
| | 
| | 
| | 
| | 
| | 
| | | I thought this was the existing behavior, but it looks like we were just
taking the first link from the first file.
In the future may refactor this out even further. | 
| |/ |  | 
| |\  
| | 
| | 
| | 
| | 
| | | Manually resolved conflicts:
    python/fatcat_tools/harvest/doi_registrars.py | 
| | | 
| | 
| | 
| | 
| | 
| | 
| | 
| | 
| | | In the past harvest of datacite resulted in occasional HTTP 400.
Meanwhile, various API bugs have been fixed (most recently:
https://github.com/datacite/lupo/pull/537,
https://github.com/datacite/datacite/issues/1038). Downside of ignoring
this error was that state lives in kafka, which has limited support for
deletion of arbitrary messages from a topic. | 
| |\ \  
| | | 
| | | 
| | | 
| | | | harvest: log the failed url
See merge request webgroup/fatcat!55 | 
| | |/ |  | 
| |\ \  
| |/  
|/|   
| |   
| | | datacite: fix test docs
See merge request webgroup/fatcat!54 |