Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | make fmt | Bryan Newbold | 2021-04-23 | 1 | -1/+5 |
| | |||||
* | database support for scholarsportal and cariniana preservation holdings | Bryan Newbold | 2020-10-08 | 1 | -0/+2 |
| | |||||
* | do not create hathitrust-only journal rows | Bryan Newbold | 2020-09-02 | 1 | -1/+2 |
| | |||||
* | hathitrust KBART-style importer | Bryan Newbold | 2020-09-02 | 1 | -1/+8 |
| | |||||
* | include pkp_pln as a kbart directory in summarization/export/etc | Bryan Newbold | 2020-08-31 | 1 | -1/+1 |
| | |||||
* | fmt | Bryan Newbold | 2020-08-31 | 1 | -8/+21 |
| | |||||
* | fatcat export improvements | Bryan Newbold | 2020-08-03 | 1 | -9/+28 |
| | |||||
* | more blocked URLs and domains | Bryan Newbold | 2020-08-03 | 1 | -0/+29 |
| | |||||
* | directories: all extra metadata in top-level dict | Bryan Newbold | 2020-08-03 | 1 | -7/+3 |
| | | | | Had been using slug-specific sub-objects, but this was too confusing. | ||||
* | skip umi.com in addition to www.umi.com | Bryan Newbold | 2020-06-23 | 1 | -0/+1 |
| | |||||
* | ensure lang is len()==2; prep for original_name column | Bryan Newbold | 2020-06-23 | 1 | -0/+5 |
| | |||||
* | block/skip more homepage patterns | Bryan Newbold | 2020-06-23 | 1 | -0/+9 |
| | |||||
* | fix langs inclusion in summarization; remove unused/duplicate fields | Bryan Newbold | 2020-06-23 | 1 | -2/+2 |
| | |||||
* | set is_active flag based on directories | Bryan Newbold | 2020-06-23 | 1 | -0/+5 |
| | |||||
* | filter out more meta/index URL hosts | Bryan Newbold | 2020-06-23 | 1 | -1/+15 |
| | |||||
* | Revert "EZB color not a good proxy for OA status" | Bryan Newbold | 2020-06-23 | 1 | -0/+2 |
| | | | | | | | | I think this actually is Ok in the context of identifying longtail journals. We don't set the `is_oa` flag in release metdata based on this chocula flag. This reverts commit 9ba5b2e307c7f61f60304ba104bf3cc8424b7163. | ||||
* | be more careful with sherpa/romeo color summarization | Bryan Newbold | 2020-06-22 | 1 | -3/+4 |
| | |||||
* | EZB color not a good proxy for OA status | Bryan Newbold | 2020-06-22 | 1 | -2/+0 |
| | |||||
* | flake8 cleanups | Bryan Newbold | 2020-06-22 | 1 | -3/+1 |
| | |||||
* | fmt (black) | Bryan Newbold | 2020-06-22 | 1 | -248/+356 |
| | |||||
* | remove un-necessary list() in iteration | Bryan Newbold | 2020-06-22 | 1 | -1/+1 |
| | |||||
* | use and pass-through 'platform' extra metadata | Bryan Newbold | 2020-06-11 | 1 | -4/+7 |
| | |||||
* | add KBART parsing/importing | Bryan Newbold | 2020-06-02 | 1 | -51/+9 |
| | |||||
* | fix tests and type annotations | Bryan Newbold | 2020-06-01 | 1 | -22/+21 |
| | |||||
* | 'everything' at least partially working | Bryan Newbold | 2020-06-01 | 1 | -107/+35 |
| | |||||
* | update code to work with new config structure | Bryan Newbold | 2020-05-07 | 1 | -2/+2 |
| | |||||
* | start a Makefile | Bryan Newbold | 2020-05-07 | 1 | -499/+254 |
| | | | | | | | | | | Move all "index" functions into classes, each in a separate file. Add lots of type annotations. Use dataclass objects to hold database rows. This aspect will need further refactoring to remove "extra" usage, probably by adding database rows to align with DatabaseInfo more closely. | ||||
* | rename chocula.database | Bryan Newbold | 2020-05-06 | 1 | -0/+1015 |