summaryrefslogtreecommitdiffstats
Commit message (Collapse)AuthorAgeFilesLines
* Improve status counting efficiencySawood Alam2020-12-171-1/+1
| | | When the input is large with a small number of unique items to be counted then counting as we go would be linear and more efficient approach than sorting and unique counting.
* normalizer: filter out a specific non-ASCII character in DOIBryan Newbold2020-11-041-1/+3
|
* entity updates: don't ingest JSTOR DOI prefixesBryan Newbold2020-10-231-0/+2
|
* Merge branch 'bnewbold-scholar-pipeline' into 'master'bnewbold2020-10-202-2/+26
|\ | | | | | | | | entity updater: new work update feed (ident and changelog metadata only) See merge request webgroup/fatcat!87
| * entity updater: new work update feed (ident and changelog metadata only)Bryan Newbold2020-10-162-2/+26
|/
* bulk citation graph workflow proposalBryan Newbold2020-10-151-0/+160
|
* Merge branch 'bnewbold-web-tweaks-20201013' into 'master'Martin Czygan2020-10-144-12/+19
|\ | | | | | | | | web coverage tweaks 20201013 See merge request webgroup/fatcat!86
| * container coverage: add keeper link and KBART holdings listBryan Newbold2020-10-131-0/+11
| |
| * release view: remove abiguous OA status indicatorBryan Newbold2020-10-131-4/+0
| |
| * container view: fix non-OA empty boxBryan Newbold2020-10-131-3/+3
| |
| * coverage: show counts and fraction in tooltip of coverage barsBryan Newbold2020-10-131-5/+5
|/
* update database/table statsBryan Newbold2020-10-122-0/+48
|
* chocula importer: small tweaks to update behaviorBryan Newbold2020-10-081-8/+6
|
* elastic transform: more preservation keepersBryan Newbold2020-10-081-1/+2
|
* update 'contributing' page in guideBryan Newbold2020-10-025-17/+176
|
* update READMEBryan Newbold2020-10-011-41/+51
|
* updates to CHANGELOGBryan Newbold2020-10-011-0/+19
|
* more metadata cleanup task notesBryan Newbold2020-10-011-0/+7
|
* Merge branch 'bnewbold-202009-polish' into 'master'Martin Czygan2020-09-2910-124/+159
|\ | | | | | | | | fatcat.wiki 2020-09 polish See merge request webgroup/fatcat!84
| * coverage: handle the case of hits, but none with yearsBryan Newbold2020-09-171-4/+5
| |
| * web: handle unknown CSL style as a cleaner 400 pageBryan Newbold2020-09-172-1/+7
| |
| * web: update sub-resource integrity and pre-loadingBryan Newbold2020-09-171-0/+13
| | | | | | | | For security/integrity and performance
| * lint cleanupsBryan Newbold2020-09-172-3/+0
| |
| * web: route constraints on fcids and UUIDsBryan Newbold2020-09-172-101/+103
| | | | | | | | | | | | | | | | | | | | | | Instead of accepting any string for these parameters and throwing a 400 error if not the correct type, implement better route matching at the framework level and return more 404s. This resolves several outstanding sentry exceptions. The "flask-uuid" was imported and seems to have been configured for this purpose previously, but I guess I never finished configuring it.
| * container view: only show OA indicator when knownBryan Newbold2020-09-171-5/+1
| | | | | | | | | | The "is_oa:False" could be that we just don't know; aren't actually distinguishing between false and blank.
| * web container view: hide preservation when no releasesBryan Newbold2020-09-171-8/+6
| |
| * web toml editing: remove sub-entities from TOMLBryan Newbold2020-09-171-0/+4
| |
| * coverage search: pretty display for ES query errorsBryan Newbold2020-09-172-1/+19
| |
| * coverage: clarify available/accessible terminologyBryan Newbold2020-09-171-1/+1
| |
* | update keepers links to keepers.issn.orgBryan Newbold2020-09-282-8/+8
| |
* | Merge branch 'martin-datacite-spammy-title' into 'master'Martin Czygan2020-09-222-0/+25
|\ \ | |/ |/| | | | | address spammy datacite titles See merge request webgroup/fatcat!85
| * address spammy datacite titlesMartin Czygan2020-09-232-0/+25
|/ | | | | | | | | seemingly from zenodo: * https://fatcat.wiki/release/rzcpjwukobd4pj36ipla22cnoi * https://doi.org/10.5281/zenodo.4041777 About 3400 records with "FULL MOVIE" in title, currently.
* homepage: small grammar tweaks (The/the)Bryan Newbold2020-09-111-3/+3
|
* ingest: default to crawl protocols.io DOIsBryan Newbold2020-09-101-0/+2
|
* Merge branch 'bnewbold-datacite-not-empty-version' into 'master'bnewbold2020-09-113-2/+3
|\ | | | | | | | | datacite: handle case of empty-string version See merge request webgroup/fatcat!83
| * datacite: handle case of empty-string versionBryan Newbold2020-09-103-2/+3
|/ | | | | Includes a tiny tweak to the datacite import sample file to test this code path.
* file_meta import notesBryan Newbold2020-09-041-0/+75
|
* update stats snapshotBryan Newbold2020-09-032-0/+47
|
* remove spurious print statementBryan Newbold2020-09-031-1/+0
|
* Merge branch 'bnewbold-file-meta-cleanups' into 'master'Martin Czygan2020-09-033-0/+149
|\ | | | | | | | | generic file entity clean-ups as part of file_meta importer See merge request webgroup/fatcat!82
| * generic file entity clean-ups as part of file_meta importerBryan Newbold2020-09-023-0/+149
|/
* Merge branch 'bnewbold-filemeta'Bryan Newbold2020-08-275-0/+162
|\
| * fix comment typo (thanks martin)Bryan Newbold2020-08-271-1/+1
| |
| * fixes and test coverage for file_meta importerBryan Newbold2020-08-214-6/+82
| |
| * initial implementation of file_meta importerBryan Newbold2020-08-213-0/+86
| |
* | Merge branch 'bnewbold-meta-tags' into 'master'Martin Czygan2020-08-251-2/+1
|\ \ | |/ |/| | | | | meta tags See merge request webgroup/fatcat!81
| * remove typo (isbn:) from metadata DC.language fieldBryan Newbold2020-08-211-1/+1
| |
| * remove placeholder description meta tagBryan Newbold2020-08-201-1/+0
|/
* Merge branch 'bnewbold-sitemap' into 'master'bnewbold2020-08-2010-7/+206
|\ | | | | | | | | basic sitemap setup See merge request webgroup/fatcat!79
| * fix SearchAction nesting in WebSite (schema.org)Bryan Newbold2020-08-201-5/+2
| | | | | | | | | | This is not related to sitemap changes, but I was reminded in google search tools when validating site.