| Commit message (Collapse) | Author | Age | Files | Lines | |
|---|---|---|---|---|---|
| * | add wikidata SPARQL queryx-attic-chocula | Bryan Newbold | 2019-07-31 | 1 | -0/+35 |
| | | |||||
| * | sqlite-notebook template for basic chocula stats | Bryan Newbold | 2019-07-31 | 2 | -0/+186 |
| | | |||||
| * | iterate on homepage url import/stats | Bryan Newbold | 2019-07-31 | 2 | -21/+43 |
| | | |||||
| * | more issn URL checker fixes | Bryan Newbold | 2019-07-31 | 2 | -11/+27 |
| | | |||||
| * | major improvements to ISSN URL checker | Bryan Newbold | 2019-07-30 | 1 | -20/+121 |
| | | |||||
| * | import vanilla ISSN url checker script | Bryan Newbold | 2019-07-30 | 1 | -0/+52 |
| | | |||||
| * | chocula: sherpa_color in summary; cleanups | Bryan Newbold | 2019-07-30 | 3 | -6/+12 |
| | | |||||
| * | chocula: openapc | Bryan Newbold | 2019-07-30 | 1 | -1/+31 |
| | | |||||
| * | chocula: json export | Bryan Newbold | 2019-07-30 | 1 | -0/+17 |
| | | |||||
| * | chocula: fix wikidata_qid inclusion | Bryan Newbold | 2019-07-30 | 1 | -2/+3 |
| | | |||||
| * | chocula: fix wikidata_qid inclusion | Bryan Newbold | 2019-07-30 | 2 | -1/+3 |
| | | |||||
| * | chocula: better ISSN-L handling | Bryan Newbold | 2019-07-30 | 4 | -24/+41 |
| | | |||||
| * | chocula: updated fetches, new ISSN-L and DOAJ files | Bryan Newbold | 2019-07-30 | 2 | -7/+10 |
| | | |||||
| * | chocula: wikidata indexing | Bryan Newbold | 2019-07-30 | 1 | -4/+48 |
| | | |||||
| * | chocula: crude publisher type bucketing; field cleanup | Bryan Newbold | 2019-07-30 | 2 | -40/+194 |
| | | |||||
| * | shorter/simpler table names | Bryan Newbold | 2019-07-26 | 2 | -9/+17 |
| | | |||||
| * | chocula: more host/domain fixes | Bryan Newbold | 2019-07-26 | 1 | -3/+8 |
| | | |||||
| * | GOLD OA parsing | Bryan Newbold | 2019-07-26 | 1 | -40/+54 |
| | | |||||
| * | chocula: fix domain parsing | Bryan Newbold | 2019-07-26 | 1 | -10/+47 |
| | | |||||
| * | pipenv: pytest for journal_metadata | Bryan Newbold | 2019-07-26 | 2 | -4/+83 |
| | | |||||
| * | chocula README | Bryan Newbold | 2019-07-14 | 1 | -0/+7 |
| | | |||||
| * | chocula: fetch SZ json | Bryan Newbold | 2019-07-14 | 1 | -0/+2 |
| | | |||||
| * | more chocula progress | Bryan Newbold | 2019-07-14 | 2 | -61/+183 |
| | | |||||
| * | EZB and szczepanski indexers | Bryan Newbold | 2019-07-11 | 1 | -45/+146 |
| | | |||||
| * | chocula early work | Bryan Newbold | 2019-07-10 | 4 | -0/+1009 |
| | | | | | (non-functional) | ||||
| * | more fixup notes (from QA server) | Bryan Newbold | 2019-06-27 | 1 | -5/+46 |
| | | |||||
| * | finish fixup_longtail_issnl_unique; but not going to run it | Bryan Newbold | 2019-06-27 | 1 | -4/+3 |
| | | |||||
| * | initial work on longtail_issnl_unique.py | Bryan Newbold | 2019-06-24 | 1 | -0/+192 |
| | | |||||
| * | stats.json update after releases v03 cut-over | Bryan Newbold | 2019-06-06 | 1 | -0/+1 |
| | | |||||
| * | elasticsearch index alias howto | Bryan Newbold | 2019-06-06 | 1 | -1/+16 |
| | | |||||
| * | QA checks (for hash, extid duplication) | Bryan Newbold | 2019-06-04 | 4 | -0/+82 |
| | | |||||
| * | recent prod table sizes; 380 GBytes or so total | Bryan Newbold | 2019-06-04 | 1 | -0/+233 |
| | | |||||
| * | dump_release_extid.sql changes for new schema | Bryan Newbold | 2019-06-03 | 1 | -1/+1 |
| | | |||||
| * | move export README info to sql_dumps doc | Bryan Newbold | 2019-06-03 | 1 | -1/+29 |
| | | |||||
| * | fix parse_merge_metadata.py merge_spans() | Bryan Newbold | 2019-05-30 | 1 | -4/+8 |
| | | |||||
| * | better KBART merging | Bryan Newbold | 2019-05-30 | 1 | -4/+5 |
| | | |||||
| * | initial code to handle multiple KBART spans better | Bryan Newbold | 2019-05-30 | 1 | -2/+64 |
| | | |||||
| * | add work-in-progress elastic index notes | Bryan Newbold | 2019-05-30 | 1 | -0/+11 |
| | | |||||
| * | add 'superceded' release extra flag to elastic schema | Bryan Newbold | 2019-05-23 | 1 | -0/+1 |
| | | |||||
| * | also track work_id in release elasticsearch table | Bryan Newbold | 2019-05-22 | 1 | -0/+1 |
| | | |||||
| * | count linked refs (not just raw refs) in elasticsearch | Bryan Newbold | 2019-05-22 | 1 | -0/+1 |
| | | |||||
| * | commit SQL table stats scripts | Bryan Newbold | 2019-05-21 | 2 | -0/+36 |
| | | |||||
| * | include creator_ids in release elastic schema | Bryan Newbold | 2019-05-20 | 1 | -0/+1 |
| | | | | | Intent is to allow fast creator search/lookup | ||||
| * | elastic release schema update | Bryan Newbold | 2019-05-20 | 1 | -1/+6 |
| | | |||||
| * | start tracking stats | Bryan Newbold | 2019-05-07 | 2 | -0/+2 |
| | | |||||
| * | IA collection page embed example description | Bryan Newbold | 2019-05-07 | 1 | -0/+45 |
| | | | | | This code has some issues, but is worth commiting | ||||
| * | old fileset and webcapture example entities | Bryan Newbold | 2019-04-30 | 2 | -0/+146 |
| | | |||||
| * | no-derive metadata and SQL dump uploads (to petabox) | Bryan Newbold | 2019-04-30 | 1 | -0/+2 |
| | | |||||
| * | faster elasticsearch imports | Bryan Newbold | 2019-04-30 | 1 | -1/+1 |
| | | |||||
| * | more bots to bootstrap | Bryan Newbold | 2019-04-24 | 1 | -0/+15 |
| | | |||||
