White Papers

Magical Thinking in Data Mining: Lessons From CoIL Challenge 2000

Category: Data Management

Tags: data mining, data

Overview CoIL challenge 2000 was a supervised learning contest that attracted 43 entries. The authors of 29 entries later wrote explanations of their work. This paper discusses these reports and reaches three main conclusions. First, naive Bayesian classifiers remain competitive in practice: they were used by both the winning entry and the next best entry. Second, identifying feature interactions correctly is important for maximizing predictive accuracy: this was the difference between the winning classifier and all others. Third and most important, too many researchers and practitioners in data mining do not appreciate properly the issue of statistical significance and the danger of overfitting.

Download White Paper

By downloading you agree to our Terms and Conditions. These include information regarding use of your personal data.

Publisher
University of California
File Format
PDF
Date Published
Oct 7, 2008
Format
White Papers
Topics
Knowledge and Data Management, Data Mining - Analysis

Similiar White Papers

Information Architecture Essentials, Part 5: Business Intelligence in Your Information Architecture

Information Architecture Essentials, Part 5: Business Intelligence in Your Information Architecture

This series explores a variety of elements that create a successful information architecture design. As one manages and

Publisher: IBM  |  Tags: business intelligence, data, data mining

Cool New Features in SAS Enterprise Miner 5.3

Cool New Features in SAS Enterprise Miner 5.3

SAS released Enterprise Miner 5.3 in late 2007 with a veritable plethora of cool new features for data miners everywhere

Publisher: SAS Institute  |  Tags: data, software

Data Mining and Knowledge Management in Higher Education - Potential Applications

Data Mining and Knowledge Management in Higher Education - Potential Applications

This paper introduces a brand new and powerful decision support tool, data mining, in the context of knowledge managemen

Publisher: Cabrillo College  |  Tags: data, data mining, knowledge management, management

Profiting From Promotions Analysis

Profiting From Promotions Analysis

In this webcast the presenter explains how to use weekly Electronic Point Of Sale (EPOS) data from retailer extranets to

Publisher: SAS Institute  |  Tags: data

Educational Data Mining: A Case Study

Educational Data Mining: A Case Study

This paper shows how using data mining algorithms can help discovering pedagogically relevant knowledge contained in dat

Publisher: University of Sydney  |  Tags: data, data mining

University of California White Papers

Stateless Load Balancing Over Multiple MPLS Paths

Stateless Load Balancing Over Multiple MPLS Paths

The paper proposes a flow-independent approach to balance the load coming from several multimedia applications (i.e., IP

Publisher: University of California  |  Tags: applications, ip, mpls, network

Escape From the Computer Lab: Education in Mobile Wireless Networks

Escape From the Computer Lab: Education in Mobile Wireless Networks

As mobile wireless network technology becomes widespread, the importance of education about this new form of communicati

Publisher: University of California  |  Tags: computing, mobile wireless, mobility, network, portable devices, university of california

Parallel Spectral Clustering Algorithm for Large-Scale Community Data Mining

Parallel Spectral Clustering Algorithm for Large-Scale Community Data Mining

The spectral clustering algorithm has been shown to be very effective in finding clusters of non-linear boundaries. Unfo

Publisher: University of California

Directed Diffusion for Wireless Sensor Networking

Directed Diffusion for Wireless Sensor Networking

Advances in processor, memory and radio technology will enable small and cheap nodes capable of sensing, communication a

Publisher: University of California  |  Tags: data, network

Mesh Topology Construction for Interconnected Wireless LANs

Mesh Topology Construction for Interconnected Wireless LANs

The 802.11s working group has been formed recently to recommend an Extended Service Set (ESS) that enables wider area co

Publisher: University of California  |  Tags: network