Fast and multi-threaded implementation of isolation forest (Liu, Ting, Zhou (2008) doi:10.1109/ICDM.2008.17), extended isolation forest (Hariri, Kind, Brunner (2018) ), SCiForest (Liu, Ting, Zhou (2010) doi:10.1007/978-3-642-15883-4_18), fair-cut forest (Cortes (2021) ), robust random-cut forest (Guha, Mishra, Roy, Schrijvers (2016) ), and customizable variations of them, for isolation-based outlier detection, clustered outlier detection, distance or similarity approximation (Cortes (2019) ), isolation kernel calculation (Ting, Zhu, Zhou (2018) doi:10.1145/3219819.3219990), and imputation of missing values (Cortes (2019) ), based on random or guided decision tree splitting, and providing different metrics for scoring anomalies based on isolation depth or density (Cortes (2021) ). Provides simple heuristics for fitting the model to categorical columns and handling missing data, and offers options for varying between random and guided splits, and for using different splitting criteria.
github.com/david-cortes/isotree | |
Copyright | see file COPYRIGHTS |
Bug report | File report |