Slides of 10+ excellent tutorials at KDD 2015: Spark, graph mining and many more

posted Aug 17, 2015, 4:01 AM by Yanchang Zhao   [ updated Aug 17, 2015, 4:02 AM ]

See slides of 10+ excellent tutorials at KDD 2015 at, incl.
  • VC-Dimension and Rademacher Averages: From Statistical Learning Theory to Sampling Algorithms
  • Graph-Based User Behavior Modeling: From Prediction to Fraud Detection
  • A New Look at the System, Algorithm and Theory Foundations of Large-Scale Distributed Machine Learning
  • Dense subgraph discovery (DSD)
  • Automatic Entity Recognition and Typing from Massive Text Corpora: A Phrase and Network Mining Approach
  • Big Data Analytics: Optimization and Randomization
  • Big Data Analytics: Social Media Anomaly Detection: Challenges and Solutions
  • Diffusion in Social and Information Networks: Problems, Models and Machine Learning Methods
  • Medical Mining
  • Large Scale Distributed Data Science using Apache Spark
  • Data-Driven Product Innovation
  • Web Personalization and Recommender Systems a mirror site for Chinese users

posted Aug 5, 2015, 5:30 AM by Yanchang Zhao   [ updated Aug 5, 2015, 5:31 AM ] now has a mirror website at Users in China can download RDataMining documents, code and data at above mirror site, if no access to

Note that will still be the primary site and please visit only when you have no access to the primary site.

Please let me know if you have access to neither of two sites below. Thanks.

CIKM Machine Learning Competition 2015

posted Jul 28, 2015, 12:29 PM by Yanchang Zhao   [ updated Jul 28, 2015, 12:29 PM ]

The CIKM Machine Learning Competition 2015 is centered around the AFL. Participants are required to predict the outcomes of every match in the 2015 AFL season in two phases:

- the Leaderboard phase, where contestants predict the outcome of each regular-season match in the 2015 AFL season. The corresponding leaderboard will be updated as the season progresses. This phase will be based on an honour system since the results of matches will already be known.

- the Finals phase, where contestants predict the outcome of each match in the 2015 AFL Finals Series. Submissions will close prior to the commencement of the first finals series match. The final leaderboard of the competition will be determined from these matches and a competition winner will be annonced after the 2015 AFL Grand Final.

The winner of the competition will be awarded $5,000 (AUD) and will be required to provide a satisfactory description of their approach.

Competition opens: 24 July 2015
Submissions close: 10 Sept 2015


The 2015 Big Data Summit, 9-10 August 2015, collocated with ACM KDD 2015, Sydney

posted Jul 14, 2015, 1:31 PM by Yanchang Zhao

The 2015 Big Data Summit
9-10 August 2015
collocated with ACM KDD 2015, Sydney

We take this privilege opportunity to invite you to
participate in the 2015 Big Data Summit:
• Co-located with ACM KDD2015
• Plenary sessions and keynote speeches by world
industrial and academic leaders
• Big data best practices and highlights in Australia
and New Zealand
• “Big Data in China” Forum
• “Data Science in India” Forum
• “Big Data in Asia” Panel

The theme of this year’s Big Data Summit is “Data to

Since the Summit’s inception in 2012, we have seen
increasing interest and investment within both
industry and government in data-led innovation and
industralisation to deeply explore big data universe,
invent data science, train data engineers and scientists,
and develop the data economy.

This year’s event aims to provide analytics professionals
and academia with a global and regional perspective to
outline the big data research, education and development
in the Asia Pacific region, showcase best practices,
explore thought-provoking insights, and demonstrate
solutions and lessons learned across industry,
government and academia.

Who Should Attend?
• Data modellers and business analysts
• Analytics professionals
• Business decision makers
• Policy executives
• Senior Government Representatives
• Academics (including research students)

What Are the Trends and Topics?
• The Data2Economy Agenda: Challenges, Trends and
• Latest Scientific Development in Data and Analytics
• The progress and future of big data in Australia and
New Zealand
• The progress and future of big data in China
• The progress and future of data science in India
• Future of Data Science and Analytics Science
• Data Economy and Industrial Transformation
• Competency, Policies and Processes
• Data Analytics Case Studies and Showcases

Why You Should Attend?
Started in 2012, the 2013 and 2014 Big Data Summit
(Sydney and Canberra) attracted over 250-300
participants from industry, government and academia.
This annual Australian Summit provides a premier and
unique forum for bridging the gaps between academia,
industry and government, and independent insights on the
advancement, best practices, trends and controversies
about data science, big data and data economy.

With very prestigious speakers across academic, industry
and government from China, India, Australia, and USA
and Europe, the 2015 Summit will cover a broad spectrum
of big data and analytics aspects and domains. The three
regional Forums organized by India, China and ANZ will
present first-hand view about progress and opportunities
in the Asia Pacific region. The “Big Data in Asia” Panel
will feature world leaders from both the Asia Pacific and
global communities, to draw a big picture of big data
innovation, services, education and economy.

Co-located with ACM KDD2015 in Hilton Sydney, BDS2015
attracts global interest, will mark a unique and high
quality opportunity for you and your organization to grasp
the cutting-edge and thought-leading progress, network
with peers and thought leaders, and most importantly,
dig out more insights and value from your big data and
lift your competency in the increasingly competitive and
challenging market and environment.

For More Information
For more details about the Summit, please visit the Website

Registration to BDS2015 will be free of charge, please Check
and Register via

For any other inquiries about the Summit, please feel free to
Contact us(

Workshop on R and Data Mining at Deakin University

posted May 4, 2015, 7:45 AM by Yanchang Zhao

I will run a workshop on R and Data Mining for students in the Master of Business Analytics course at Deakin University in Melbourne on Thursday 28 May. See workshop slides at Training.

AusDM 2015 submission deadline extended to Thursday 30 April

posted Apr 20, 2015, 1:15 PM by Yanchang Zhao

Submission deadline of the 13th Australasian Data Mining Conference (AusDM 2015) has been extended to Thursday 30 April. See details at

IAPA seminar: Big Data in the Social Sciences, Canberra, 29 April

posted Apr 20, 2015, 11:45 AM by Yanchang Zhao

Speaker: A/Prof. Robert Ackland, ANU
Date: Wednesday 29 April
Time: 5.30pm for a 6pm start
Cost: Nil
Where: SAS Offices, 12 Moore Street, Canberra, ACT 2600

RDataMining Group has 10,000 members today

posted Feb 11, 2015, 1:13 PM by Yanchang Zhao

RDataMining Group on LinkedIn has 10,000 members today! Join us for knowledge & experience sharing on R & Data Mining

UIUC and Stanford free online courses on data mining

posted Feb 8, 2015, 6:05 AM by Yanchang Zhao

See a couple of free online courses on data mining provided by UIUC and Standford University at Course & Seminar News.

Seminar: Text Analytics -- Natural Language into Big Data, Canberra, Tuesday 17 February

posted Feb 3, 2015, 3:48 AM by Yanchang Zhao

Topic: Text Analytics: Natural Language into Big Data

Speaker: Dr. Leif Hanlen, Technology Director at NICTA

Date: Tuesday 17 February

Time: 5.30pm for a 6pm start

Cost: Nil

Where: SAS Offices, 12 Moore Street, Canberra, ACT 2600

Registration URL: 

1-10 of 74