Changelog
Source:NEWS.md
edstr 0.3.0 (2026-05-02)
Breaking changes
edstr_import()andedstr_clean()now save cache files as Parquet (.parquet) instead of RDS (.rds). Existing.rdscache files from earlier versions will not be detected and must be deleted or regenerated.edstr_extract()no longer saves a.csvfile. Results are now saved as.xlsx,.json, and.rds.
Internal changes
- Code style unified across all R files with
airformatter. - Pre-commit hooks added:
air-format(posit-dev/air-pre-commit v0.9.0) andjarl-check(etiennebacher/jarl-pre-commit v0.4.0), both scoped to.Rfiles. -
.Rbuildignoreregex patterns fixed (^air\.toml$, added^jarl\.toml$). -
.Rprofileremoved (JVM auto-detection viafindno longer needed). -
DESCRIPTION: migrated from legacyRoxygen/RoxygenNotefields toConfig/roxygen2/markdownandConfig/roxygen2/version. -
dplyr::if_anyanddplyr::sliceadded to explicit imports.
edstr 0.2.0 (2026-03-26)
Breaking changes
All CLI messages, error messages, and interactive menus are now in English.
edstr_view(): theidargument now defaults toNULLinstead of auto-detecting at call time. The column is still auto-detected internally, but this avoids errors whendatais not yet available at function definition.
New features
edstr_extract()gains anexclus_auto_token_minargument (default10) to control the minimum n-gram size for automatic exclusion heuristics. Previously this was hard-coded.edstr_config()now validates all arguments on input and raises informative errors for wrong types or lengths.edstr_clean()now validates thereplaceargument structure (must be a named character vector or list of named character vectors) and checks that the text column exists indatabefore processing.