White Papers

R-SpamRank: A Spam Detection Algorithm Based on Link Analysis

Overview Spam web pages intend to achieve higher-than-deserved ranking by various techniques. While human experts could easily identify spam web pages, the manual evaluating process of a large number of pages is still time consuming and cost consuming. To assist manual evaluation, the paper proposes an algorithm to assign spam values to web pages and semi-automatically selects potential spam web pages. They first manually select a small set of spam pages as seeds. Then, based on the link structure of the web, the initial R-SpamRank values assigned to the seed pages propagate through links and distribute among the whole web page set. After sorting the pages according to their R-SpamRank values, the pages with high values are selected.

Download White Paper

By downloading you agree to our Terms and Conditions. These include information regarding use of your personal data.

Publisher
Tsinghua University
File Format
PDF
Date Published
Jul 1, 2009
Format
White Papers
Topics
Spam - E-mail Fraud - Phishing, Network Security, Software Engineering

Similiar White Papers

Intelligent Detection Approaches for Spam

Intelligent Detection Approaches for Spam

This paper proposes intelligent detection approaches based on Incremental Support Vector Machine and Artificial Immune S

Publisher: Peking University  |  Tags: benchmark, spam

WITCH: A New Approach to Web Spam Detection

WITCH: A New Approach to Web Spam Detection

This paper presents an algorithm, witch that learns to detect spam hosts or pages on the Web. Unlike most other approach

Publisher: Yahoo  |  Tags: benchmark, spam

The Dangerous Economics of Spam Control

The Dangerous Economics of Spam Control

The adoption of a wide range of regulatory and technical measures against spam has not constrained its growth and sophis

Publisher: COMDOM Software  |  Tags: developers, network, network providers, software, spam

Spam-Resilient Web Rankings Via Influence Throttling

Spam-Resilient Web Rankings Via Influence Throttling

Web search is one of the most critical applications for managing the massive amount of distributed Web content. Due to t

Publisher: Institute of Electrical and Electronics Engineers  |  Tags: applications, spam

Propagating Trust and Distrust to Demote Web Spam

Propagating Trust and Distrust to Demote Web Spam

Web spamming describes behavior that attempts to deceive search engine's ranking algorithms. TrustRank is a recent algor

Publisher: Lehigh University  |  Tags: data, spam

Tsinghua University White Papers

Live Video Streaming Service Over Peer to Peer Network: Design, Implementation and Experience

Live Video Streaming Service Over Peer to Peer Network: Design, Implementation and Experience

Providing live video streaming service over peer to peer network to a large population of end users remains challenging

Publisher: Tsinghua University  |  Tags: applications, data, network, peer to peer

User Behavior Oriented Web Spam Detection

User Behavior Oriented Web Spam Detection

Combating Web spam has become one of the top challenges for Web search engines. State-of-the-art spam detection techniqu

Publisher: Tsinghua University  |  Tags: data, spam

Firewall Design: Understandable, Designable and Testable

Firewall Design: Understandable, Designable and Testable

Firewalls are the cornerstones of network security. To make firewalls working effectively, firewall manager must design

Publisher: Tsinghua University  |  Tags: firewall, management, network, network security

System Design Considerations of Highly-Integrated UHF RFID Reader Transceiver RF Front-End

System Design Considerations of Highly-Integrated UHF RFID Reader Transceiver RF Front-End

Nowadays the implementation of highly integrated UHF RFID reader transceiver RF front-ends with excellent performance is

Publisher: Tsinghua University  |  Tags: rfid

VICS: A Storage Virtualization Management System for SAN

VICS: A Storage Virtualization Management System for SAN

Storage Area Networks (SANs) have the virtues of high scalability, high availability and high performance. On the other

Publisher: Tsinghua University  |  Tags: management, network, operating systems, sans, storage management