The methods for finding the bounds within which this combination is invoked have been re-designed to make them both faster and more effective. This option can lead to noticeably better predictive performance and is now recommended for applications with many continuous attributes.
See5 (Windows 2000/Xp/Vista) and its Unix counterpart C5.0 are sophisticated data mining tools for discovering patterns that delineate categories, assembling them into classifiers, and using them to make predictions.
Some important features:
- • See5/C5.0 has been designed to analyze substantial databases containing thousands to hundreds of thousands of records and tens to hundreds of numeric, time, date, or nominal fields. See5/C5.0 also takes advantage of processors with quad cores, up to four CPUs, or Intel Hyper-Threading to speed up the analysis.
- • To maximize interpretability, See5/C5.0 classifiers are expressed as decision trees or sets of if-then rules, forms that are generally easier to understand than neural networks.
- • See5/C5.0 is available for Windows 2000/Xp/Vista and Linux.
- See5/C5.0 is easy to use and does not presume any special knowledge of Statistics or Machine Learning (although these don't hurt, either!)
- • RuleQuest provides C source code so that classifiers constructed by See5/C5.0 can be embedded in your organization's own systems.