aboutsummaryrefslogtreecommitdiffstats
Commit message (Expand)AuthorAgeFilesLines
...
* nice simple hack for config loadingBryan Newbold2020-05-071-26/+17
* pipenv: tomlBryan Newbold2020-05-072-33/+35
* start of basic testsBryan Newbold2020-05-0715-0/+641
* update test/lint scopeBryan Newbold2020-05-072-1/+3
* move fix_invalid_issnl from old/ to extra/Bryan Newbold2020-05-071-0/+0
* move journal summary files to reports folderBryan Newbold2020-05-072-0/+1018
* schema: removed unused/commented linesBryan Newbold2020-05-071-19/+0
* start a MakefileBryan Newbold2020-05-0719-580/+1039
* pytest configBryan Newbold2020-05-061-0/+10
* gitlab-ci first attemptBryan Newbold2020-05-061-0/+15
* rename chocula.databaseBryan Newbold2020-05-062-1/+1
* start refactoring files into moduleBryan Newbold2020-05-067-458/+470
* pipenv: py37, black, mypyBryan Newbold2020-05-062-227/+226
* update to new(er) ISSN-L mapping fileBryan Newbold2020-05-012-2/+2
* move queries list to sqlite-notebook report formatBryan Newbold2019-12-264-116/+1375
* update URL crawl status snapshotBryan Newbold2019-12-262-5/+2
* add check to container stat fetch to ensure valid JSON returnedBryan Newbold2019-12-261-1/+1
* add stats and URL crawl status filesBryan Newbold2019-12-242-2/+6
* count chocula logo (yay)Bryan Newbold2019-12-241-0/+0
* example queries to run on sqliteBryan Newbold2019-12-242-0/+64
* update README with better directionsBryan Newbold2019-12-242-16/+48
* move old scripts into subdirectoryBryan Newbold2019-12-233-0/+0
* update chocula usage of argparseBryan Newbold2019-12-231-14/+22
* update norwegian CSV importer schemaBryan Newbold2019-12-231-2/+4
* update chocula input data filesBryan Newbold2019-12-233-38/+35
* use newer fatcat contianer dumpBryan Newbold2019-09-062-1/+3
* filter out bad ISSN{e,p}Bryan Newbold2019-09-061-0/+5
* last name/publisher cleanupsBryan Newbold2019-09-031-2/+6
* update TODOBryan Newbold2019-09-031-1/+10
* don't include doaj.org or NCBI homepage URLsBryan Newbold2019-09-031-0/+4
* improve fatcat_export metadata qualityBryan Newbold2019-09-031-3/+12
* fix SZCEPANSKI typoBryan Newbold2019-09-031-2/+2
* improve export_fatcatBryan Newbold2019-08-281-5/+22
* python script to fix fatcat ISSN-LsBryan Newbold2019-08-271-0/+75
* hand-coded corrections to invalid fatcat ISSN-LsBryan Newbold2019-08-271-88/+88
* current invalid fatcat ISSN-LsBryan Newbold2019-08-271-0/+118
* only fatcat_export 'valid' (syntax) ISSN-LsBryan Newbold2019-08-271-1/+1
* include Szczepanski in everything command (oops)Bryan Newbold2019-08-271-0/+1
* updated crossref title file; ISSN-L file linkBryan Newbold2019-08-273-3/+3
* update IA_CRAWL_FILEBryan Newbold2019-07-311-1/+1
* commit TODO listBryan Newbold2019-07-311-0/+37
* update fetch.sh with url_status filesBryan Newbold2019-07-311-0/+3
* webarchive_urls separate from regular URLsBryan Newbold2019-07-311-1/+21
* don't return 'error' for bad CDX lookupsBryan Newbold2019-07-311-1/+3
* add 'export_fatcat'Bryan Newbold2019-07-311-1/+51
* README updateBryan Newbold2019-07-311-21/+35
* more check_issn_urls corner-casesBryan Newbold2019-07-311-1/+5
* handle 'ttp://' URL prefix corner caseBryan Newbold2019-07-311-0/+2
* broader top-level gitignoreBryan Newbold2019-07-311-0/+25
* remove python 3.5 constraintBryan Newbold2019-07-312-6/+4