Changelog
manydist 0.5.0
CRAN release: 2026-06-09
Major changes
- Expanded
manydistfrom a package focused on mixed-type distance construction to a broader framework for distance-based learning with mixed-type data. - Updated the package title and description to reflect support for distance construction, distance-based modelling workflows, variable-importance diagnostics, and clustering.
- Changed the package maintainer from Angelos Markos to Alfonso Iodice D’Enza.
Distance construction
- Added a revised
mdist()interface and documentation for mixed-type distance construction. - Added support for additional mixed-type distance specifications and presets.
- Added response-aware distance construction tools for supervised mixed-type workflows.
- Added interaction-aware distance components for continuous-categorical relationships.
- Added helper infrastructure for preprocessing and applying mixed-type distance specifications consistently across training and new data.
- Added utilities for generating and benchmarking distance-method specifications.
Distance-based learning workflows
- Added
step_mdist()for integratingmanydistdistances intorecipesand tidymodels workflows. - Added
nearest_neighbor_dist()and related prediction functions for nearest-neighbour models based on precomputed or manydist-generated distances. - Added
pam_dist()for partitioning around medoids using manydist dissimilarities. - Added
spectral_dist()andspectral_from_dist()for spectral clustering from distance matrices. - Added support functions for converting distances to affinities and fitting distance-based clustering models.
Variable importance and diagnostics
- Added
lovo_mdist()for leave-one-variable-out diagnostics of distance matrices. - Added
compare_lovo_mdist()andlovo_method_spec()for comparing LOVO diagnostics across multiple distance specifications. - Added congruence- and alienation-based diagnostics for comparing multidimensional scaling configurations.
- Added optional clustering-based LOVO diagnostics using PAM, hierarchical clustering, and spectral clustering.
Data generation and benchmarking
- Added
gen_mixed()andgenerate_dataset()for generating mixed-type example and simulation data. - Added
benchmark_mdist()for benchmarking distance specifications across datasets and method grids. - Added
all_dist_method_specs()and distance-method metadata helpers.