Knowledge Discovery from Data (TKDD)


Search Issue
enter search term and/or author name


ACM Transactions on Knowledge Discovery from Data (TKDD), Volume 8 Issue 4, October 2014

Probabilistic Reframing for Cost-Sensitive Regression
José Hern´ndez-Orallo
Article No.: 17
DOI: 10.1145/2641758

Common-day applications of predictive models usually involve the full use of the available contextual information. When the operating context changes, one may fine-tune the by-default (incontextual) prediction or may even abstain from predicting a...

MDL4BMF: Minimum Description Length for Boolean Matrix Factorization
Pauli Miettinen, Jilles Vreeken
Article No.: 18
DOI: 10.1145/2601437

Matrix factorizations—where a given data matrix is approximated by a product of two or more factor matrices—are powerful data mining tools. Among other tasks, matrix factorizations are often used to separate global structure from...

Feature Selection for Social Media Data
Jiliang Tang, Huan Liu
Article No.: 19
DOI: 10.1145/2629587

Feature selection is widely used in preparing high-dimensional data for effective data mining. The explosive popularity of social media produces massive and high-dimensional data at an unprecedented rate, presenting new challenges to feature...

Efficient Discovery of Association Rules and Frequent Itemsets through Sampling with Tight Performance Guarantees
Matteo Riondato, Eli Upfal
Article No.: 20
DOI: 10.1145/2629586

The tasks of extracting (top-K) Frequent Itemsets (FIs) and Association Rules (ARs) are fundamental primitives in data mining and database applications. Exact algorithms for these problems exist and are widely used, but their running time...

Discovering Social Circles in Directed Graphs
Scott H. Burton, Christophe G. Giraud-Carrier
Article No.: 21
DOI: 10.1145/2641759

We examine the problem of identifying social circles, or sets of cohesive and mutually aware nodes surrounding an initial query set, in directed graphs where the complete graph is not known beforehand. This problem differs from local community...

Random Projections for Linear Support Vector Machines
Saurabh Paul, Christos Boutsidis, Malik Magdon-Ismail, Petros Drineas
Article No.: 22
DOI: 10.1145/2641760

Let X be a data matrix of rank ρ, whose rows represent n points in d-dimensional space. The linear support vector machine constructs a hyperplane separator that maximizes the 1-norm soft margin. We develop a new oblivious...

Reconstructing Graphs from Neighborhood Data
Dóra Erdős, Rainer Gemulla, Evimaria Terzi
Article No.: 23
DOI: 10.1145/2641761

Consider a social network and suppose that we are only given the number of common friends between each pair of users. Can we reconstruct the underlying network? Similarly, consider a set of documents and the words that appear in them. If we...