ACM Transactions on Knowledge Discovery from Data (TKDD), Volume 2 Issue 4, January 2009

Feature-preserved sampling over streaming data
Kun-Ta Chuang, Hung-Leng Chen, Ming-Syan Chen
Article No.: 15
DOI: 10.1145/1460797.1460798

In this article, we explore a novel sampling model, called feature preserved sampling (FPS) that sequentially generates a high-quality sample over sliding windows. The sampling quality we consider refers to the degree of consistency...

Mining frequent cross-graph quasi-cliques
Daxin Jiang, Jian Pei
Article No.: 16
DOI: 10.1145/1460797.1460799

Joint mining of multiple datasets can often discover interesting, novel, and reliable patterns which cannot be obtained solely from any single source. For example, in bioinformatics, jointly mining multiple gene expression datasets obtained by...

Weighted cluster ensembles: Methods and analysis
Carlotta Domeniconi, Muna Al-Razgan
Article No.: 17
DOI: 10.1145/1460797.1460800

Cluster ensembles offer a solution to challenges inherent to clustering arising from its ill-posed nature. Cluster ensembles can provide robust and stable solutions by leveraging the consensus across multiple clustering results, while averaging...

On domination game analysis for microeconomic data mining
Zhenjie Zhang, Laks V. S. Lakshmanan, Anthony K. H. Tung
Article No.: 18
DOI: 10.1145/1460797.1460801

Game theory is a powerful tool for analyzing the competitions among manufacturers in a market. In this article, we present a study on combining game theory and data mining by introducing the concept of domination game analysis. We present a...