Knowledge Discovery from Data (TKDD)


Search Issue
enter search term and/or author name


ACM Transactions on Knowledge Discovery from Data (TKDD), Volume 6 Issue 2, July 2012

A Sequential Sampling Framework for Spectral k-Means Based on Efficient Bootstrap Accuracy Estimations: Application to Distributed Clustering
Dimitrios Mavroeidis, Panagis Magdalinos
Article No.: 5
DOI: 10.1145/2297456.2297457

The scalability of learning algorithms has always been a central concern for data mining researchers, and nowadays, with the rapid increase in data storage capacities and availability, its importance has increased. To this end, sampling has been...

A Model for Information Growth in Collective Wisdom Processes
Sanmay Das, Malik Magdon-Ismail
Article No.: 6
DOI: 10.1145/2297456.2297458

Collaborative media such as wikis have become enormously successful venues for information creation. Articles accrue information through the asynchronous editing of users who arrive both seeking information and possibly able to contribute...

Generative Models for Evolutionary Clustering
Tianbing Xu, Zhongfei Zhang, Philip S. Yu, Bo Long
Article No.: 7
DOI: 10.1145/2297456.2297459

This article studies evolutionary clustering, a recently emerged hot topic with many important applications, noticeably in dynamic social network analysis. In this article, based on the recent literature on nonparametric Bayesian models, we have...

The Latent Maximum Entropy Principle
Shaojun Wang, Dale Schuurmans, Yunxin Zhao
Article No.: 8
DOI: 10.1145/2297456.2297460

We present an extension to Jaynes’ maximum entropy principle that incorporates latent variables. The principle of latent maximum entropy we propose is different from both Jaynes’ maximum entropy principle and maximum likelihood...

Cross-Guided Clustering: Transfer of Relevant Supervision across Tasks
Indrajit Bhattacharya, Shantanu Godbole, Sachindra Joshi, Ashish Verma
Article No.: 9
DOI: 10.1145/2297456.2297461

Lack of supervision in clustering algorithms often leads to clusters that are not useful or interesting to human reviewers. We investigate if supervision can be automatically transferred for clustering a target task, by providing a relevant...