The Big Graph Mining (BGM) workshop brings together researchers and practitioners to address
various aspects of graph mining in this new era of big data, such as new
graph mining platforms, theories that drive new graph mining techniques,
scalable algorithms and visual analytics tools
that spot patterns and anomalies,
applications that
touch our daily lives, and more. Together, we explore and discuss how
these important facets of are advancing in this age of big graphs.
From Graphs to Tables: The Design of Scalable Systems for Graph Analytics
From social networks to language modeling, the growing scale and importance of graph data has driven the development of new graph-parallel systems. In this talk, I will review the graph-parallel abstraction and describe how it can be used to express important machine learning and graph analytics algorithms like PageRank and Latent factor models. I will present how systems like GraphLab and Pregel exploit restrictions in the graph-parallel abstraction along with advances in distributed graph representation to efficiently execute iterative graph algorithms orders of magnitude faster than more general data-parallel systems.
Unfortunately, the same restrictions that enable graph-parallel systems to achieve substantial performance gains also limit their ability to express many of the important stages in a typical graph-analytics pipeline. As a consequence, existing approaches to graph-analytics typically compose multiple systems through brittle and costly file interfaces. To fill the need for a holistic approach to graph-analytics we introduce GraphX, which unifies graph-parallel and data-parallel computation under a single API and system. I will show how a simple set of data-parallel operators can be used to express graph-parallel computation and how, by applying a collection of query optimizations derived from our work on graph-parallel systems, we can execute entire graph-analytics pipelines efficiently in a single distributed fault-tolerant system achieving performance comparable to specialized state-of-the-art systems.
Joseph Gonzalez is a postdoc in the AMPLab at UC Berkeley. Joseph received his PhD from the Machine Learning Department at Carnegie Mellon University where he worked with Carlos Guestrin on parallel algorithms and abstractions for scalable probabilistic machine learning. Joseph is a recipient of the AT&T Labs Graduate Fellowship and the NSF Graduate Research Fellowship.
15:10
Talk (20 min)
Active Learning with Partially Featured Data
Seungwhan Moon, Calvin McCarter, Yu-Hsin Kuo (Carnegie Mellon University)
15:30
Coffee
16:00
Talks (20 min each)
A Visual Analytic Approach for Exploring Large-Scale Document Data (Invited Talk)
Jaegul Choo (Georgia Institute of Technology)
Detecting Community Structure for Undirected Big Graphs Based on Random Walks
www.KDnuggets.com
Analytics, Big Data, Data Mining, & Data Science
Resources
Program Committee
Aapo Kyrola (Carnegie Mellon University)
Adam Perer (IBM, USA)
Aditya Prakash (Virginia Tech)
Aydin Buluc (Lawrence Berkeley National Laboratory)
Danai Koutra (Carnegie Mellon University)
Danny Bickson (GraphLab)
Deepak Rajan (LLNL)
Emmanuel Müller (Karlsruhe Institute of Technology)
Feida Zhu (Singapore Management University)
Hanghang Tong (City University of New York)
Harsha Madhyastha (University of California, Riverside)
Jaegul Choo (Georgia Tech)
Jian Pei (Simon Fraser University)
Jilles Vreeken (Max Planck Institute)
Jimeng Sun (IBM, USA)
Jingu Kim (Nokia Research Center)
Joseph Gonzalez (UC Berkeley)
Le Song (Georgia Tech)
Mladen Kolar (University of Chicago)
Nan Cao (IBM, USA)
Partha Talukdar (Carnegie Mellon University)
Philip Yu (University of Illinois at Chicago)
Rada Mihalcea (University of Michigan)
Sael Lee (Stony Brook University, Korea)
Sanjay Chawla (The University of Sydney)
Sivaraman Balakrishnan (Carnegie Mellon University)
Spiros Papadimitriou (Rutgers University)
Stephan Günnemann (Carnegie Mellon University)
Tina Eliassi-Rad (Rutgers University)
Vagelis Hristidis (UC Riverside)
Wook-Shin Han (POSTECH)
Xifeng Yan (UC Santa Babara)
Yizhou Sun (University of Illinois at Chicago)
Yucheng Low (GraphLab)
Call for Papers
Topics of interests include, but are not limited to:
Visual analytics and visualization of large graphs
Analysis of streaming/dynamic/time-evolving graphs
Machine learning on graphs
Community detection
Graph sampling
Spectral graph analysis
Social network analysis
Biological network analysis
Anomaly detection in graphs
Active learning / mining
Theoretical/complexity analysis of graph mining
Demonstrations of graph mining applications
Applications of graph mining methods on real-world problems
Important Dates
Submission
Fri, Jan 17, 2014, 23:59 Hawaii time (HST)
Notification
Tue, Feb 4
Camera-ready
Wed, Feb 12
Workshop
Mon, Apr 7
Submission Information
All papers will be peer reviewed, single-blinded.
We welcome many kinds of papers, such as (and not limited to):
Novel research papers
Demo papers
Work-in-progress papers
Visionary papers (white papers)
Authors should clearly indicate in their abstracts the kinds of
submissions that the papers belong to, to help reviewers better
understand their contributions.
Submissions must be in PDF, written in English,
no more than 6 pages long — shorter papers are welcome —
and formatted according to the standard double-column
ACM Proceedings Style (Tighter Alternate style).
For accepted papers, at least one author must attend the workshop to present the work.
Accepted papers will be included in the ACM Digital Library.
If you plan to extend your workshop paper submitted to our BGM'14 workshop,
and submit that extended work to future WWW conferences, please note the
following message from the WWW workshop co-chairs:
"Any paper published by the ACM, IEEE, etc. which can be properly cited
constitutes research which must be considered in judging the novelty of
a WWW submission, whether the published paper was in a conference,
journal, or workshop. Therefore, any paper previously published as part
of a WWW workshop must be referenced and extended with new content to
qualify as a new submission to the Research Track at the WWW conference."