White Papers

Characterizing Web Spam Using Content and HTTP Session Analysis

Category: Security

Tags: spam, ip

Overview Web spam research has been hampered by a lack of statistically significant collections. This paper performs the first large-scale characterization of web spam using content and HTTP session analysis techniques on the Webb Spam Corpus - a collection of about 350,000 web spam pages. Their content analysis results are consistent with the hypothesis that web spam pages are different from normal web pages, showing far more duplication of physical content and URL redirections. An analysis of session information collected during the crawling of the Webb Spam Corpus shows significant concentration of hosting IP addresses in two narrow ranges as well as significant overlaps among session header values.

Download White Paper

By downloading you agree to our Terms and Conditions. These include information regarding use of your personal data.

Publisher
Georgia Institute of Technology
File Format
PDF
Date Published
Jul 1, 2009
Format
White Papers
Topics
Spam - E-mail Fraud - Phishing, Network Security, Security Management

Similiar White Papers

Top five strategies for combating modern threats: Is anti-virus dead?

Top five strategies for combating modern threats: Is anti-virus dead?

Today's fast, targeted, silent threats take advantage of the open network and new technologies that support an increasin

Publisher: Sophos  |  Tags: email, malware, network

Gain a Competitive Advantage by Aligning Your IT Infrastructure with Business Objectives

Gain a Competitive Advantage by Aligning Your IT Infrastructure with Business Objectives

This paper looks at what IT Security means to your company and how services can assist in the battle against the threats

Publisher: IBM

Sophos Email Security and Control - Free 30 Day Trial

Sophos Email Security and Control - Free 30 Day Trial

Proactively block inbound and outbound threats with unrivaled effectiveness and simplicity, delivering high-capacity, hi

Publisher: Sophos

What is the (Real) Threat and How to Deal With It? A Route to Security as a Service

What is the (Real) Threat and How to Deal With It? A Route to Security as a Service

This paper looks at what IT Security means to your company and how services can assist in the battle against the threats

Publisher: IBM

Demystifying Web 2.0: Opportunities, Threats, Defenses

Demystifying Web 2.0: Opportunities, Threats, Defenses

Every new technology introduced into the enterprise brings with it new threats. Web 2.0 is no different, with threats in

Publisher: Clearswift  |  Tags: downtime, social networking, spyware

Georgia Institute of Technology White Papers

Bandwidth Estimation: Metrics, Measurement Techniques, and Tools

Bandwidth Estimation: Metrics, Measurement Techniques, and Tools

In a packet network, the terms "Bandwidth" or "Throughput" often characterize the amount of data that the network can tr

Publisher: Georgia Institute of Technology  |  Tags: data, ip, network, open source, peer-to-peer

Scalability of Network-Failure Resilience

Scalability of Network-Failure Resilience

This work quantifies scalability of network resilience upon failures. It characterize resilience as the percentage of lo

Publisher: Georgia Institute of Technology  |  Tags: network

Bandwidth Estimation and Robust Video Streaming Over 802.11e Wireless LANs

Bandwidth Estimation and Robust Video Streaming Over 802.11e Wireless LANs

Streaming high quality Audio/Video (AV) from home media sources to TV sets over a Wireless Local Area Network (WLAN) is

Publisher: Georgia Institute of Technology  |  Tags: qos, tv

Improving the Performance of TCP Wireless Video Streaming With a Novel Playback Adaptation Algorithm

Improving the Performance of TCP Wireless Video Streaming With a Novel Playback Adaptation Algorithm

This paper proposes a playback adaptation algorithm for video streaming with TCP in wireless networks where both handoff

Publisher: Georgia Institute of Technology  |  Tags: ip, wireless networks

A Cooperative Intrusion Detection System for Ad Hoc Networks

A Cooperative Intrusion Detection System for Ad Hoc Networks

Mobile Ad hoc NETworking (MANET) has become an exciting and important technology in recent years because of the rapid pr

Publisher: Georgia Institute of Technology  |  Tags: management, network