Mode | Name | Size | |
---|---|---|---|
-rw-r--r-- | backfill_scalding_rewrite.txt | 715 | logstatsplain |
-rw-r--r-- | crawl_cdx_merge.md | 751 | logstatsplain |
-rw-r--r-- | dryad_datasets.md | 658 | logstatsplain |
d--------- | examples | 203 | logstatsplain |
-rw-r--r-- | fuzzy_match_notes.md | 4217 | logstatsplain |
-rw-r--r-- | grobid_munging.txt | 3043 | logstatsplain |
-rw-r--r-- | hadoop_job_log.md | 9011 | logstatsplain |
-rw-r--r-- | hbase_table_sizes.txt | 179 | logstatsplain |
-rw-r--r-- | html_ingest_notes.md | 11627 | logstatsplain |
d--------- | ingest | 2326 | logstatsplain |
-rw-r--r-- | ingest_domains.txt | 13476 | logstatsplain |
-rw-r--r-- | library_shopping.txt | 353 | logstatsplain |
-rw-r--r-- | match_filter_enrich.txt | 1367 | logstatsplain |
-rw-r--r-- | old_extract_results.txt | 1361 | logstatsplain |
-rw-r--r-- | petabox_ia_metadata.txt | 2022 | logstatsplain |
-rw-r--r-- | possible_ingest_targets.txt | 898 | logstatsplain |
d--------- | tasks | 792 | logstatsplain |
-rw-r--r-- | url_pattern_heuristic_backfill.txt | 4132 | logstatsplain |
-rw-r--r-- | url_pattern_heuristic_verification.txt | 3637 | logstatsplain |