| Commit message (Collapse) | Author | Age | Files | Lines |
|
|
|
|
| |
Including removing index document names (use '_doc' instead during
transition)
|
|
|
|
|
|
|
|
|
| |
This will increase index size (URLs are often long in our corpus, and we
have many file entities), but seems worth it.
Initially added `ia_url` as a second field, guaranteed to always be an
*.archive.org URL, but `best_url` defaults to that anyways so didn't
seem worthwhile.
|
|
|
|
| |
But not applying asciifolding; don't see any need to do so?
|
| |
|
| |
|
|
Includes a trivial test and transform, but not any workers or doc
updates.
|