enter search term and/or author name
Large-Scale Cross-Language Web Page Classification via Dual Knowledge Transfer Using Fast Nonnegative Matrix Trifactorization
Hua Wang, Feiping Nie, Heng Huang
Article No.: 1
With the rapid growth of modern technologies, Internet has reached almost every corner of the world. As a result, it becomes more and more important to manage and mine information contained in Web pages in different languages. Traditional...
Social Influence Based Clustering and Optimization over Heterogeneous Information Networks
Yang Zhou, Ling Liu
Article No.: 2
Social influence analysis has shown great potential for strategic marketing decision. It is well known that people influence one another based on both their social connections and the social activities that they have engaged in the past. In this...
Evangelos E. Papalexakis, Christos Faloutsos, Nicholas D. Sidiropoulos
Article No.: 3
How can we efficiently decompose a tensor into sparse factors, when the data do not fit in memory? Tensor decompositions have gained a steadily increasing popularity in data-mining applications; however, the current state-of-art decomposition...
Algorithms for Mining the Coevolving Relational Motifs in Dynamic Networks
Rezwan Ahmed, George Karypis
Article No.: 4
Computational methods and tools that can efficiently and effectively analyze the temporal changes in dynamic complex relational networks enable us to gain significant insights regarding the entity relations and their evolution. This article...
Hierarchical Density Estimates for Data Clustering, Visualization, and Outlier Detection
Ricardo J. G. B. Campello, Davoud Moulavi, Arthur Zimek, Jörg Sander
Article No.: 5
An integrated framework for density-based cluster analysis, outlier detection, and data visualization is introduced in this article. The main module consists of an algorithm to compute hierarchical estimates of the level sets of a density,...
Utility-Theoretic Ranking for Semiautomated Text Classification
Giacomo Berardi, Andrea Esuli, Fabrizio Sebastiani
Article No.: 6
Semiautomated Text Classification (SATC) may be defined as the task of ranking a set D of automatically labelled textual documents in such a way that, if a human annotator validates (i.e., inspects and corrects where appropriate) the...
During the last decade, microblog has become an important social networking service with billions of users all over the world, acting as a novel and efficient platform for the creation and dissemination of real-time information. Modeling and...
Smart Multitask Bregman Clustering and Multitask Kernel Clustering
Xianchao Zhang, Xiaotong Zhang, Han Liu
Article No.: 8
Traditional clustering algorithms deal with a single clustering task on a single dataset. However, there are many related tasks in the real world, which motivates multitask clustering. Recently some multitask clustering algorithms have been...
Measuring Temporal Patterns in Dynamic Social Networks
Wei Wei, Kathleen M. Carley
Article No.: 9
Given social networks over time, how can we measure network activities across different timesteps with a limited number of metrics? We propose two classes of dynamic metrics for assessing temporal evolution patterns of agents in terms of...
The availability of trajectories tracking the geographical locations of people as a function of time offers an opportunity to study human behaviors. In this article, we study rationality from the perspective of user decision on visiting a point of...