Commit message (Collapse) | Author | Age | Files | Lines | ||
---|---|---|---|---|---|---|
... | ||||||
* | remove fetch.sh; gitignore at top-level | Bryan Newbold | 2020-06-02 | 3 | -49/+1 | |
| | ||||||
* | fixes for KBART import | Bryan Newbold | 2020-06-02 | 3 | -9/+17 | |
| | ||||||
* | add KBART parsing/importing | Bryan Newbold | 2020-06-02 | 4 | -63/+169 | |
| | ||||||
* | refactor main commands | Bryan Newbold | 2020-06-02 | 1 | -59/+77 | |
| | ||||||
* | warn, but don't skip mangled SIM years | Bryan Newbold | 2020-06-02 | 1 | -3/+2 | |
| | ||||||
* | rename summary template | Bryan Newbold | 2020-06-02 | 2 | -0/+0 | |
| | ||||||
* | fix tests and type annotations | Bryan Newbold | 2020-06-01 | 4 | -23/+58 | |
| | ||||||
* | 'everything' at least partially working | Bryan Newbold | 2020-06-01 | 10 | -180/+284 | |
| | ||||||
* | add missing test files | Bryan Newbold | 2020-05-07 | 2 | -0/+40 | |
| | | | | These were gitignored | |||||
* | update ROMEO tests to join with policies and have ISSN-L mappings | Bryan Newbold | 2020-05-07 | 3 | -2/+44 | |
| | ||||||
* | WIP (commented out) on Makefile | Bryan Newbold | 2020-05-07 | 1 | -13/+59 | |
| | ||||||
* | sources in TOML markup | Bryan Newbold | 2020-05-07 | 1 | -0/+123 | |
| | ||||||
* | update code to work with new config structure | Bryan Newbold | 2020-05-07 | 14 | -19/+20 | |
| | ||||||
* | nice simple hack for config loading | Bryan Newbold | 2020-05-07 | 1 | -26/+17 | |
| | ||||||
* | pipenv: toml | Bryan Newbold | 2020-05-07 | 2 | -33/+35 | |
| | ||||||
* | start of basic tests | Bryan Newbold | 2020-05-07 | 15 | -0/+641 | |
| | ||||||
* | update test/lint scope | Bryan Newbold | 2020-05-07 | 2 | -1/+3 | |
| | ||||||
* | move fix_invalid_issnl from old/ to extra/ | Bryan Newbold | 2020-05-07 | 1 | -0/+0 | |
| | ||||||
* | move journal summary files to reports folder | Bryan Newbold | 2020-05-07 | 2 | -0/+1018 | |
| | ||||||
* | schema: removed unused/commented lines | Bryan Newbold | 2020-05-07 | 1 | -19/+0 | |
| | ||||||
* | start a Makefile | Bryan Newbold | 2020-05-07 | 19 | -580/+1039 | |
| | | | | | | | | | | Move all "index" functions into classes, each in a separate file. Add lots of type annotations. Use dataclass objects to hold database rows. This aspect will need further refactoring to remove "extra" usage, probably by adding database rows to align with DatabaseInfo more closely. | |||||
* | pytest config | Bryan Newbold | 2020-05-06 | 1 | -0/+10 | |
| | ||||||
* | gitlab-ci first attempt | Bryan Newbold | 2020-05-06 | 1 | -0/+15 | |
| | ||||||
* | rename chocula.database | Bryan Newbold | 2020-05-06 | 2 | -1/+1 | |
| | ||||||
* | start refactoring files into module | Bryan Newbold | 2020-05-06 | 7 | -458/+470 | |
| | ||||||
* | pipenv: py37, black, mypy | Bryan Newbold | 2020-05-06 | 2 | -227/+226 | |
| | ||||||
* | update to new(er) ISSN-L mapping file | Bryan Newbold | 2020-05-01 | 2 | -2/+2 | |
| | ||||||
* | move queries list to sqlite-notebook report format | Bryan Newbold | 2019-12-26 | 4 | -116/+1375 | |
| | ||||||
* | update URL crawl status snapshot | Bryan Newbold | 2019-12-26 | 2 | -5/+2 | |
| | ||||||
* | add check to container stat fetch to ensure valid JSON returned | Bryan Newbold | 2019-12-26 | 1 | -1/+1 | |
| | ||||||
* | add stats and URL crawl status files | Bryan Newbold | 2019-12-24 | 2 | -2/+6 | |
| | ||||||
* | count chocula logo (yay) | Bryan Newbold | 2019-12-24 | 1 | -0/+0 | |
| | ||||||
* | example queries to run on sqlite | Bryan Newbold | 2019-12-24 | 2 | -0/+64 | |
| | ||||||
* | update README with better directions | Bryan Newbold | 2019-12-24 | 2 | -16/+48 | |
| | ||||||
* | move old scripts into subdirectory | Bryan Newbold | 2019-12-23 | 3 | -0/+0 | |
| | ||||||
* | update chocula usage of argparse | Bryan Newbold | 2019-12-23 | 1 | -14/+22 | |
| | ||||||
* | update norwegian CSV importer schema | Bryan Newbold | 2019-12-23 | 1 | -2/+4 | |
| | ||||||
* | update chocula input data files | Bryan Newbold | 2019-12-23 | 3 | -38/+35 | |
| | | | | | Including updating fetch script, README links, and chocula.py path references. | |||||
* | use newer fatcat contianer dump | Bryan Newbold | 2019-09-06 | 2 | -1/+3 | |
| | ||||||
* | filter out bad ISSN{e,p} | Bryan Newbold | 2019-09-06 | 1 | -0/+5 | |
| | | | | | Unfortunately a few hundred of these got pushed into fatcat already; will probably fix with a new fixer bot tool. | |||||
* | last name/publisher cleanups | Bryan Newbold | 2019-09-03 | 1 | -2/+6 | |
| | ||||||
* | update TODO | Bryan Newbold | 2019-09-03 | 1 | -1/+10 | |
| | ||||||
* | don't include doaj.org or NCBI homepage URLs | Bryan Newbold | 2019-09-03 | 1 | -0/+4 | |
| | ||||||
* | improve fatcat_export metadata quality | Bryan Newbold | 2019-09-03 | 1 | -3/+12 | |
| | ||||||
* | fix SZCEPANSKI typo | Bryan Newbold | 2019-09-03 | 1 | -2/+2 | |
| | ||||||
* | improve export_fatcat | Bryan Newbold | 2019-08-28 | 1 | -5/+22 | |
| | ||||||
* | python script to fix fatcat ISSN-Ls | Bryan Newbold | 2019-08-27 | 1 | -0/+75 | |
| | ||||||
* | hand-coded corrections to invalid fatcat ISSN-Ls | Bryan Newbold | 2019-08-27 | 1 | -88/+88 | |
| | ||||||
* | current invalid fatcat ISSN-Ls | Bryan Newbold | 2019-08-27 | 1 | -0/+118 | |
| | | | | | AKA, list of fatcat containers with an ISSN-L that isn't a valid ISSN (based on checksum) | |||||
* | only fatcat_export 'valid' (syntax) ISSN-Ls | Bryan Newbold | 2019-08-27 | 1 | -1/+1 | |
| |