White Papers
Magical Thinking in Data Mining: Lessons From CoIL Challenge 2000
Category: Data Management
Tags: data mining, data
Overview CoIL challenge 2000 was a supervised learning contest that attracted 43 entries. The authors of 29 entries later wrote explanations of their work. This paper discusses these reports and reaches three main conclusions. First, naive Bayesian classifiers remain competitive in practice: they were used by both the winning entry and the next best entry. Second, identifying feature interactions correctly is important for maximizing predictive accuracy: this was the difference between the winning classifier and all others. Third and most important, too many researchers and practitioners in data mining do not appreciate properly the issue of statistical significance and the danger of overfitting.
- Publisher
- University of California
- File Format
- Date Published
- Oct 7, 2008
- Format
- White Papers
- Topics
- Knowledge and Data Management, Data Mining - Analysis
Similiar White Papers
Information Architecture Essentials, Part 5: Business Intelligence in Your Information Architecture
This series explores a variety of elements that create a successful information architecture design. As one manages and
Publisher: IBM | Tags: business intelligence, data, data mining
Cool New Features in SAS Enterprise Miner 5.3
SAS released Enterprise Miner 5.3 in late 2007 with a veritable plethora of cool new features for data miners everywhere
Publisher: SAS Institute | Tags: data, software
Data Mining and Knowledge Management in Higher Education - Potential Applications
This paper introduces a brand new and powerful decision support tool, data mining, in the context of knowledge managemen
Publisher: Cabrillo College | Tags: data, data mining, knowledge management, management
Profiting From Promotions Analysis
In this webcast the presenter explains how to use weekly Electronic Point Of Sale (EPOS) data from retailer extranets to
Publisher: SAS Institute | Tags: data
Educational Data Mining: A Case Study
This paper shows how using data mining algorithms can help discovering pedagogically relevant knowledge contained in dat
Publisher: University of Sydney | Tags: data, data mining
University of California White Papers
Stateless Load Balancing Over Multiple MPLS Paths
The paper proposes a flow-independent approach to balance the load coming from several multimedia applications (i.e., IP
Publisher: University of California | Tags: applications, ip, mpls, network
Escape From the Computer Lab: Education in Mobile Wireless Networks
As mobile wireless network technology becomes widespread, the importance of education about this new form of communicati
Publisher: University of California | Tags: computing, mobile wireless, mobility, network, portable devices, university of california
Parallel Spectral Clustering Algorithm for Large-Scale Community Data Mining
The spectral clustering algorithm has been shown to be very effective in finding clusters of non-linear boundaries. Unfo
Publisher: University of California
Directed Diffusion for Wireless Sensor Networking
Advances in processor, memory and radio technology will enable small and cheap nodes capable of sensing, communication a
Publisher: University of California | Tags: data, network
Mesh Topology Construction for Interconnected Wireless LANs
The 802.11s working group has been formed recently to recommend an Extended Service Set (ESS) that enables wider area co
Publisher: University of California | Tags: network
Featured white papers
-
The Value of Location Intelligence in the Communications Industry
Public Services are under pressure, the challenge is to do more with less. How do you improve citizen satisfaction, increase cost efficiencies and improve service delivery? The power of location intelligence is helping many local authorities...
-
Best Practices for Translating Customer Satisfaction into Revenue
Today's support organisations are focused on two top-level metrics: financial results and customer satisfaction. For most, it's easy to track financial performance, but customer satisfaction is akin to speaking a foreign language...
-
HP print solutions and 3M
The objective for 3M was to optimize office printing infrastructure at 3M locations worldwide, reduce total cost and environmental footprint. Some of the business benefits acheived by switching to HP print solutions...
-
Check out these top business apps for your iPhone
-
Inside a Microsoft datacentre
-
Green IT without losing your edge
-
Peter Cochrane's latest video blog
-
What you need to know about Windows 7