Data Science community

Statistics, we have a problem 3201 retweets

Recently, while browsing Twitter, I saw a few machine learning researchers post about an incident at one of their big conferences (NIPS) in which a band performing at the closing party made jokes…

deeplearningai: Announcing new Deep Learning courses on Coursera 2630 retweets

I have been working on three new AI projects, and am thrilled to announce the first one: deeplearning.ai, a project dedicated to disseminating AI knowledge, is launching a new sequence of Deep…

We've created a bot which beats the world's top professionals at 1v1 m... 2431 retweets

We've created a bot which beats the world's top professionals at 1v1 matches of Dota 2 [ under standard tournament rules. The bot learned the game from scratch by self-play, and does not use imitation learning or tree search. ...

Fundamentals of Data Visualization 2338 retweets

A guide to making visualizations that accurately reflect the data, tell a story, and look professional.

Deep Learning Specialization 2044 retweets

Learn Deep Learning from deeplearning.ai. If you want to break into AI, this Specialization will help you do so. Deep Learning is one of the most highly sought after skills in tech. We will help you become good at Deep Learning. In five courses, ...

What Artificial Intelligence Can and Can’t Do Right Now 1984 retweets

What AI can and cannot do for you today. My Harvard Business Review piece on how AI affects your org.

Opening a new chapter of my work in AI 1905 retweets

I will be resigning from Baidu, where I have been leading the company’s AI Group. Baidu’s AI is incredibly strong, and the team is stacked up and down with talent; I am confident AI at Baidu will…

Unsupervised Sentiment Neuron 1895 retweets

We’ve developed an unsupervised system which learns an excellent representation of sentiment, despite being trained only to predict the next character in the text of Amazon reviews.

Machine Learning Yearning 1367 retweets

Get a free draft copy of my book on how to structure Machine Learning projects: I’d started this before but got distracted building Deep Learning Specialization; I’m now rebooting this. Sign up to get free chapters as they’re released!

Self-driving cars are here – Andrew Ng – Medium 1324 retweets

Drive.ai will offer a self-driving car service for public use in Frisco, Texas starting in July, 2018. Self-driving cars are no longer a futuristic AI technology. They’re here, and will soon make…

Software 20 – Andrej Karpathy – Medium 1154 retweets

I sometimes see people refer to neural networks as just “another tool in your machine learning toolbox”. They have some pros and cons, they work here or there, and sometimes you can use them to win…

Evolution Strategies as a Scalable Alternative to Reinforcement Learni... 1131 retweets

We've discovered that evolution strategies (ES), an optimization technique that's been known for decades, rivals the performance of standard reinforcement learning (RL) techniques on modern RL benchmarks, while overcoming many of RL's inconveniences.

AI expert: Worry more about jobs than killer robots 1038 retweets

While there has been a lot of talk about super-smart artificial intelligence lately, one of the leaders in the field thinks there are more pressing problems for humanity to solve. Andrew Ng, the cofounder of Coursera and former chief scientist at Chi...

Guess what? Study shows that self-driving cars are better at detecting... 1033 retweets

Guess what? Study shows that self-driving cars are better at detecting pedestrians with lighter skin tones. Translation: Pedestrian deaths by self-driving cars are already here - but they're not evenly distributed.

Revitalizing manufacturing through AI 986 retweets

I am excited to announce Landing.ai, a new Artificial Intelligence company that will help other enterprises transform for the age of AI. We will initially focus on the manufacturing industry. AI is…

Anatomy of an AI System 973 retweets

Thrilled to launch a big project today: ANATOMY OF AN AI SYSTEM. It's a large map & long-form essay about Amazon's Echo, and the full stack of capital, labor, and natural resources used in AI. It's a collab with TheCreaturesLab, who is a visual ...

Data science is different now · Vicki Boykis 930 retweets

New blog post: For the past couple years, I've been telling people who ask me for advice not to go into data science. Here's why: The data science job market is way oversaturated. Here's what they should do instead.

Seedbank — discover machine learning examples 920 retweets

Today we’re launching Seedbank, a place to discover interactive ML examples which you can run from your browser, no set-up required. Each example can be edited, extended, and adapted into your own project. Read mtyka's post for more info ↓

Nonlinear Computation in Deep Linear Networks 859 retweets

We've shown that deep linear networks — as implemented using floating-point arithmetic — are not actually linear and can perform nonlinear computation. We used evolution strategies to find parameters in linear networks that exploit this trait, letti...

AI Transformation Playbook How to lead your company into the AI era 835 retweets

Introducing the AI Transformation Playbook! How can your company become good at AI? This 5-step Playbook provides the key steps for transforming your company using AI, from executing pilot projects to building a team, and more. Download your free co...

AI For Everyone | Coursera 805 retweets

Learn AI For Everyone from deeplearning.ai. AI is not only for engineers. If you want your organization to become better at using AI, this is the course to tell everyone--especially your non-technical colleagues--to take. In this course, you ...

This is What Happens When You Teach an AI to Name Guinea Pigs 756 retweets

Using deep learning to generate guinea pig names from images. Startlingly good.

MURA Dataset: Towards Radiologist-Level Abnormality Detection in Muscu... 754 retweets

Can your AI model detect abnormalities in bone X-rays as well as a radiologist? My Stanford lab just released a new dataset, MURA. Join our deep learning competition to see how your model compares: pranavrajpurkar jeremy_irvin16 mattlungrenM...

Probability Distributions 742 retweets

A beautiful, visual statistics textbook: "Seeing Theory – A visual introduction to probability and statistics.” I shared this a year ago, when it was already very cool. But the page now got a major upgrade by the authors and is now just incredibly...

Perceptions of Probability and Numbers 719 retweets

Perceptions of Probability and Numbers. Contribute to zonination/perceptions development by creating an account on GitHub.

Cardiologist-Level Arrhythmia Detection With Convolutional Neural Netw... 702 retweets

Technical details on our Deep Learning+ECG (detecting irregular heartbeats/arrhythmia) work:

Google AI Blog: Looking Back at Google’s Research Efforts in 2018 698 retweets

On behalf of the whole Google Research & GoogleAI community, I was excited to put together a post describing some of the work that we collectively did in 2018. I hope you enjoy it! Thanks to everyone who helped make this work possible!

Cardiologist-level arrhythmia detection and classification in ambulato... 681 retweets

Our new paper in Nature Medicine! Cardiologist-level arrhythmia detection from ECG, using deep learning. I'm very optimistic about such technology helping patients. awnihannun, pranavrajpurkar, leftbundle, GeoffTison iRhythmTech

Improving Palliative Care with Deep Learning 669 retweets

Improving the quality of end-of-life care for hospitalized patients is a priority for healthcare organizations. Studies have shown that physicians tend to over-estimate prognoses, which in combination with treatment inertia results in a mismatch betw...

Imperial launches one of world's first online Masters in Machine Learn... 669 retweets

Imperial College imperialcollege is launching an online Masters degree in Machine Learning with coursera. This is exciting! Given the huge global demand for machine learning engineers, the world needs a lot more degree programs in AI and ML.

CheXNet: Radiologist-Level Pneumonia Detection on Chest X-Rays with D... 658 retweets

We develop an algorithm that can detect pneumonia from chest X-rays at a level exceeding practicing radiologists. Our algorithm, CheXNet, is a 121-layer convolutional neural network trained on ChestX-ray14, currently the largest publicly available ch...

Visual Attribute Transfer through Deep Image Analogy 655 retweets

We propose a new technique for visual attribute transfer across images that may have very different appearance but have perceptually similar semantic structure. By visual attribute transfer, we mean transfer of visual information (such as color, tone...

Learning Dexterity 617 retweets

A real robot hand, trained with same learning algorithm and code as OpenAI Five, has learned human-like motions to rotate objects:

A Code of Ethics for Data Science 609 retweets

2.5 quintillion bytes of data are created every day. It’s created by you when you’re commute to work or school, when you’re shopping, when you get a medical treatment, and even when you’re sleeping…

The implausibility of intelligence explosion 592 retweets

In 1965, I. J. Good described for the first time the notion of “intelligence explosion”, as it relates to artificial intelligence (AI): Decades later, the concept of an “intelligence explosion” —…

Nevergrad: An open source tool for derivative-free optimization 589 retweets

We are open-sourcing Nevergrad, a Python3 library that makes it easier to perform gradient-free optimizations used in many machine learning tasks.

Readings in applied data science 580 retweets

Next week I'll be offering a course at Stanford called "Readings in applied data science" — the complete syllabus is now available at Thanks to everyone on twitter who has helped me find good readings!

DSC Data Science Search Engine 578 retweets

Nearly 300 Statistical Concepts Explained in Simple English — in 10 Parts: abdsc BigData DataScience DataLiteracy StatisticalLiteracy Statistics MachineLearning

The AI Hierarchy of Needs – Hacker Noon 554 retweets

As is usually the case with fast-advancing technologies, AI has inspired massive FOMO , FUD and feuds. Some of it is deserved, some of it not — but the industry is paying attention. From stealth…

From       Research ToProduction 542 retweets

An open source deep learning platform that provides a seamless path from research prototyping to production deployment.

Pearson Tested 'Social-Psychological' Messages in Learning Software, W... 510 retweets

Ed tech company experiments on 9000 kids without anyone's consent or knowledge to see if they test differently when 'social-psychological' messaging is secretly inserted? HARD NO.

Will New Machine Learning Benchmark Help Propel AI Forward? 506 retweets

Let the AI benchmarking wars begin. Today, a diverse group from academia and industry – Google, Baidu, Intel, AMD, Harvard, and Stanford among them –

Bias detectives: the researchers striving to make algorithms fair 502 retweets

Nature just published a major feature on researchers working on bias in machine learning. Features many of us, including geomblog mikarv s010n achould b_mittelstadt rayidghani dgrobinson bowlinearl - and the AINowInstitute work on due proces...

A TensorFlow implementation of the Differentiable Neural Computer 501 retweets

We've open sourced the Differentiable Neural Computer! Built with Sonnet and TensorFlow. Available here:

interactive tutorials on machine learning, deep learning, R, and data ... 501 retweets

Excited to launch Kaggle Learn - interactive tutorials on machine learning, deep learning, R, and data visualization

Data-driven Advice for Applying Machine Learning to Bioinformatics Pr... 501 retweets

As the bioinformatics field grows, it must keep pace not only with new data but with new algorithms. Here we contribute a thorough analysis of 13 state-of-the-art, commonly used machine learning algorithms on a set of 165 publicly available classific...

Adversarial Examples that Fool both Computer Vision and Time-Limited ... 492 retweets

Machine learning models are vulnerable to adversarial examples: small changes to images can cause computer vision models to make mistakes such as identifying a school bus as an ostrich. However, it is still an open question whether humans are prone t...

Andrew Ng: How to Choose Your First AI Project 488 retweets

If your company wants to ramp up in AI, how should you choose your first few projects? I just wrote a Harvard Business Review HarvardBiz article with practical suggestions. This fleshes out the first step from the AI Transformation Playbook.

New UN deal with data mining firm Palantir raises protection concerns 467 retweets

The UN has made a deal with Palantir to give them highly sensitive data about aid recipients in the World Food Program - part of their “very aggressive digital transformation journey.” World reacts in horror. 😱

Today's the day!!! Watch OpenAI Five Finals here, starting 11:30a PT: ... 465 retweets

Today's the day!!! Watch OpenAI Five Finals here, starting 11:30a PT: This event will be the first time an AI has attempted to play the world champions in an esports game — and that won't be all we show today.

The combo of sexism and AI hype on is awful "She's quirky, but will n 461 retweets

The combo of sexism and AI hype on is awful. "She's quirky, but will never ghost you" ... it's a computer program.

Revisiting Small Batch Training for Deep Neural Networks 449 retweets

Modern deep neural network training is typically based on mini-batch stochastic gradient optimization. While the use of large mini-batches increases the available computational parallelism, small batch training has been shown to provide improved gene...

What's the difference between data science, machine learning, and arti... 449 retweets

When I introduce myself as a data scientist, I often get questions like “What’s the difference between that and machine learning?” or “Does that mean you work on artificial intelligence?” I’ve responded enough times that my answer easily qualifies fo...

Nova Ng – Andrew Ng – Medium 446 retweets

We’re enjoying this precious time with our first child. As probably every parent understands, we’ve also been reflecting on the world Nova will grow up in. Specifically, the long term impact of AI…

Constructed Career Paths from Job Switching Data 437 retweets

Shifting from one occupation to another can take a swing in the career path. Given your current job, what paths could you take? Here are some constructed possibilities.

CheXpert: A Large Dataset of Chest X-Rays and Competition for Automate... 432 retweets

Announcing CheXpert! Large dataset of chest X-rays co-released with MIT's MIMIC-CXR dataset. Join our competition to test your chest X-ray interpretation model: jeremy_irvin16 pranavrajpurkar mlko53 curtlanglotz mattlungrenMD Stanford

The Python Graph Gallery 432 retweets

The Python Graph Gallery: Useful for discovering and learning how to code dataviz in Python.

A Short List of Books for Doing New Things 421 retweets

Andrew Ng, the Chief Scientist at Baidu Research, has done some of the most interesting new things of the last decade. He recommends reading the following.

Seeing Theory - Regression Analysis 417 retweets

Wonderfully interactive, gentle, & well done introduction to probability and statistics. Walk through this with your favorite kid & give them a head-start in life on ML.

students did not benefit from studying according to their supposed lea... 415 retweets

We think we learn better according to our so-called learning style: visual, auditory, and kinesthetic. But the overwhelming evidence is that we don't. We all learn through listening, reading, and doing.

Hidden Technical Debt in Machine Learning Systems 412 retweets

Hidden Technical Debt in Machine Learning Systems on some of the new joys and struggles of deploying machine learning models in the wild. Still a long way to go to establish new language and design patterns for programming the 2.0 stack

Large Collection of Neural Nets, Numpy, Pandas, Matplotlib, Scikit and... 400 retweets

Large Collection of DataScience and MachingLearning Cheat Sheets for DataScientists, including Python, R, NeuralNetworks, Numpy, Pandas, and more: abdsc BigData DeepLearning AI Coding Rstats DataWrangling Statistics Probability

MIT Deep Learning Basics: Introduction and Overview with TensorFlow 397 retweets

Check out the new and first blog post by lexfridman from MIT on Deep Learning Basics with TensorFlow, as part of the MIT Deep Learning series of courses, lectures, and tutorials. Read the post here ↓

My Neural Network isn't working! What should I do? 390 retweets

A list of common mistakes made by newcomers to neural networks. DeepLearning MachineLearning DataScience

Christmas Carols, generated by a neural network 390 retweets

Neural networks are a type of computer program that imitate the way that brains learn to solve problems. They’re used for face recognition, self-driving cars, language translation, financial decisions, and more. I mainly use them to write humor. My...

An Empirical Evaluation of Generic Convolutional and Recurrent Networ... 387 retweets

For most deep learning practitioners, sequence modeling is synonymous with recurrent networks. Yet recent results indicate that convolutional architectures can outperform recurrent networks on tasks such as audio synthesis and machine translation. Gi...

Fooling Neural Networks in the Physical World with 3D Adversarial Obje... 386 retweets

Great demo: Fooling neural networks in the physical world with 3D adversarial objects. MachineLearning This is a serious security problem for DeepLearning.

Coursera, purveyor of MOOCs, bets big on university degrees 383 retweets

Get a Masters Degree through Coursera! I'm excited about all the great universities now offering online degrees. It's not just about lower costs, even more important is the convenience+flexibility+global access.

Ask the Question, Visualize the Answer 381 retweets

Let’s work through a practical example to see how asking and answering questions helps guide you towards more focused data graphics.

Converter tools for Core ML 374 retweets

Apple's CoreML tools are now on GitHub & open to contributions! Run models from Keras/sklearn/XGBoost/LibSVM on iOS