Department of Mathematics
 Search | Help | Login | pdf version | printable version

Math @ Duke



Publications [#328806] of Mauro Maggioni

Papers Published

  1. Tomita, TM; Maggioni, M; Vogelstein, JT, ROFLMAO: Robust oblique forests with linear MAtrix operations, Proceedings of the 17th SIAM International Conference on Data Mining, SDM 2017 (January, 2017), pp. 498-506, ISBN 9781611974874
    (last updated on 2017/12/10)

    Copyright © by SIAM. Random Forest (RF) remains one of the most widely used general purpose classification methods. Two recent largescale empirical studies demonstrated it to be the best overall classification method among a variety of methods evaluated. One of its main limitations, however, is that it is restricted to only axis-aligned recursive partitions of the feature space. Consequently, RF is particularly sensitive to the orientation of the data. Several studies have proposed "oblique" decision forest methods to address this limitation. However, these methods either have a time and space complexity significantly greater than RF, are sensitive to unit and scale, or empirically do not perform as well as RF on real data. One promising oblique method that was proposed alongside the canonical RF method, called Forest-RC (F-RC), has not received as much attention by the community. Despite it being just as old as RF, virtually no studies exist investigating its theoretical or empirical performance. In this work, we demonstrate that F-RC empirically outperforms RF and another recently proposed oblique method called Random Rotation Random Forest, while approximately maintaining the same computational complexity. Furthermore, a variant of F-RC which rank transforms the data prior to learning is especially invariant to affine transformations and robust to data corruption. Open source code is available.
ph: 919.660.2800
fax: 919.660.2821

Mathematics Department
Duke University, Box 90320
Durham, NC 27708-0320