Emerging Technologies

How to learn big data for free

Bernard Marr
Share:
Our Impact
What's the World Economic Forum doing to accelerate action on Emerging Technologies?
The Big Picture
Explore and monitor how Data Science is affecting economies, industries and global issues
A hand holding a looking glass by a lake
Crowdsource Innovation
Get involved with our crowdsourced digital platform to deliver impact at scale
Stay up to date:

Data Science

There are expected to be 4.4 million big data jobs by 2015 in governments and every sector of industry. Combine this with a shortage of people trained to carry out the analysis needed (predicted to be nearly 200,000 by 2018) and depending on your point of view you have either a lot of unfilled vacancies, or a lucrative career ahead of you.

But won’t you need a degree and relevant experience? Well, possibly. Not everyone can afford to spend years going back to college and retraining, but there are alternatives.

Increasingly colleges and universities are putting courses online where they can be studied for free. You may not get a degree at the end, but that might not be important. IBM big data evangelist James Kobielus said in 2013 “academic credentials are important but not necessary for high-quality data science. The core aptitudes – curiosity, intellectual agility, statistical fluency, research stamina, scientific rigor, skeptical nature – that distinguish the best data scientists are widely distributed throughout the population.”

Some of the courses do offer certificates of completion or other forms of accreditation, which can certainly go on your CV to impress potential employers.

Of course if you’re not in the employment market – say you run your own business – then these are valuable purely for the knowledge they can give you. There’s no reason that a reasonably competent person couldn’t use that knowledge to launch their own data strategy and reap insights, whatever their business. I would love to hear in the comments section if anyone has done this.

Here’s an overview of what’s available online from various schools, colleges and universities:

The University of Washington’s Introduction to Data Science is available online at Coursera – a huge repository of online learning.
The course can be completed in 8 weeks if you put in 10 to 12 hours’ study per week, and covers the history of data science, key techniques and technologies such as MapReduce and Hadoop as well as traditional relational databases, designing experiments using statistical modeling, and visualizing results.

Some basic programming knowledge is needed, but don’t worry there are plenty of free courses where you can pick that up too, if you don’t already have it (see below).

Coursera’s courses usually run between set dates – if you want accreditation or certificates, you have to register before a set date and complete them before a final deadline. However if you are just interested in the knowledge, you can download all the course materials – which come as videos and reading material – to browse at your leisure.

Harvard also makes its Data Science course available for free online. All lectures are uploaded as videos shortly after they take place, and materials and homework assignments are uploaded to the open source knowledge repository Github.

This course covers what it calls the key facets of a big data investigation: Data wrangling, management, exploratory analysis, prediction and communication of results. Some basic Python knowledge is required.

A familiarity with the basic concepts of statistics is fundamental to big data analysis. You can learn them from Stanford’s course Statistics One, also on Coursera.

The course assumes very little background knowledge and describes itself as “a comprehensive yet friendly” introduction to the subject. It is also designed to work as a refresher for anyone who may have studied it at school or college in the past but let themselves get a bit rusty on the fundamentals!

Those looking for slightly more in-depth or specialist knowledge may be interested in Stanford’s Algorithms: Design and Analysis course. Programming knowledge is essential – you will be expected to know at least one language, i.e C, Java or Python.

The course covers the fundamental principles behind algorithmic design – design paradigms, randomized algorithms and probability, graph algorithms and data structures.

Speaking of programming, a basic level of familiarity with at least one language is recommended for anyone interested in data. Python is a good choice, as it is designed for very fast processing of very large datasets, and is widely used in big data enterprise. Codeacademy.com , Coursera.org and MIT all offer free courses in Python designed for absolute beginners with no programming experience.

If you’re interested in machine learning – the fast growing field of creating self-learning algorithms that can adapt themselves based on data with no human input – there are courses for that too.

The California Institute of Technology’s Learning from Data course includes all of the lectures uploaded to Youtube and iTunes for convenience. It’s one for those who already have some academic background in computer science and are looking to move into a field where a lot of exciting breakthroughs are being made.

Visualization is key to gaining insights from data. Graphs, charts and other far more creative techniques are employed to help us spot patterns hidden in mountains of numbers or unstructured data. UC Berkeley makes its Visualization course available for free online, which can teach you techniques and algorithms used to create effective and well-designed graphical representations of data. You will need some familiarity with one popular graphics API (such as OpenGL or GDI+) as well as one data application (Excel will do). Whichever you choose is up to you as the assignments can be submitted in any format.

Here you have it, you can learn all about Big Data for free – so no more excuses! Hope this post was useful? As always, if you know of any other free Big Data learning resources then please share them in the comments below.

If you want to read my future big data articles then please click ‘Follow‘ and send me a LinkedIn invite. And, of course, feel free to also connect via Twitter, Facebookand The Advanced Performance Institute.

To get a quick overview of big data, have a look at this video:

This article is published in collaboration with LinkedIn. Publication does not imply endorsement of views by the World Economic Forum.

To keep up with the Agenda subscribe to our weekly newsletter.

Author: Bernard Marr is Best-Selling Author, Keynote Speaker and Leading Business and Data Expert.

Image: Internet LAN cables are pictured in this photo illustration taken in Sydney June 23, 2011. REUTERS/Tim Wimborne.

Don't miss any update on this topic

Create a free account and access your personalized content collection with our latest publications and analyses.

Sign up for free

License and Republishing

World Economic Forum articles may be republished in accordance with the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International Public License, and in accordance with our Terms of Use.

The views expressed in this article are those of the author alone and not the World Economic Forum.

Share:
World Economic Forum logo
Global Agenda

The Agenda Weekly

A weekly update of the most important issues driving the global agenda

Subscribe today

You can unsubscribe at any time using the link in our emails. For more details, review our privacy policy.

UK proposal to ban smartphones for kids, and other technology stories you need to know

Cathy Li

May 7, 2024

About Us

Events

Media

Partners & Members

  • Join Us

Language Editions

Privacy Policy & Terms of Service

© 2024 World Economic Forum