Return to site

Mastery of Human Languages and the Next Big Thing in AI:

Interview with NLP Expert Dima Korolev

· NLP

Mastery of Human Languages and the Next Big Thing in AI:

Interview with NLP Expert Dima Korolev

broken image

MD: Can you provide some background about your upbringing?

I got my first computer at 6 or 7, and enjoyed writing programs more than playing games during my entire life. Algorithms always left me fascinated. My real career started at Google in Zurich, and then Microsoft in Bellevue. My dream has been to build startups, and that is what I am doing right now.What advice would you give to yourself after you graduated from the Moscow Engineering Physics Institute?

MD: What advice would you give to yourself after you graduated from the Moscow Engineering Physics Institute?

As long as you are not pushing yourself to some predetermined career goals, and instead trying to explore while you are still young, you will be fine. Thankfully, if you find yourself privileged enough, you do not have to make the choices that will make you more money in the immediate future, and instead you can choose something that is more fruitful and exciting. Betting on compounding returns with respect to investing in oneself generally pays off, and it is doubly so these days when it comes to building software and data products.

MD: Why did you choose to pursue research in data engineering and machine learning?

I like research and academia, but it was quickly apparent that I like building things more. While doing research you look at the numbers, and, if you get lucky, your paper gets published. As an engineer what you are working on goes out into the world and gets results. Data is a good field where you have to understand math and it is not just data engineering, it is a combination of engineering and research.

MD: Which paper or published work that you have written so far have you been most proud of, or believe is most significant? Why?

I have not published works. The big companies I was employed by like filing patents, and I have a few with my name to them.

Some of the patents we filed with my teammates are kind of interesting, but not really groundbreaking if you ask me. I knew it before starting in the industry that when you teach people how to write software and it clicks with them, it is rewarding and enjoyable. It always gives me more gratification when the colleagues of mine, or the people I am coaching get this excitement of broadening their knowledge, and I was lucky enough to be surrounded by grateful people often enough to not have the urge to do research and publish it to be peer-reviewed. In addition to this, contributing to open source projects, at least in my experience, has had higher returns, both in terms of keeping me busy with something useful, and when it comes to connecting with interesting people to later work for or with.

MD: Can you talk more about your May 2013 patent about Construction of Text Classifiers?

Basically, the idea for that patent is you do not want to show adult pages for non-adult viewers; accidental exposure is wrong. To make it happen, there is an underlying machine learning model which can tell adult content from non-adult content. So you have to prove that it is high-precision before releasing it. Before I came to Google, my job had to do with text, and semantic analysis was just emerging. During my first year at Google we literally used to build seed models semi-manually, and it usually takes a bunch of refinement steps and spans quarters. As our team grew and we got more experience, it became apparent that several manual steps are largely redundant, as relatively straightforward techniques can automate them away to a high degree. Specifically, this particular patent is about mining text-level features out of thin air, with no prior language knowledge whatsoever, by using the combination of several disparate, but otherwise not too individually strong, non-textual signals.

MD: What has been your favorite project at FriendlyData?

FriendlyData is an interesting beast. A lot of people think I am a Natural Language Processing (NLP) expert, but it is not true. When it comes to computers understanding the human language, you need to analyze sentiment, you have to use some fancy techniques like recurrent neural nets. At the same time, if you want to translate English queries into the programming language which a database can understand, with every single stage being as good as 90% accurate, the probabilities have to be multiplied, and, with complex inputs, it quickly goes way below 50%, rendering the project useless.

The error rate of the approach similar to the one taken by Google translate would be too high, while FriendlyData had to guarantee that the database requests we built are precise. And the approach we took was the opposite of modern trends. Together with another NLP expert whom I admire greatly, we came up with the means of defining the grammar and applying it quickly at the query time.

The most interesting project that emerged from this, which I view as one of the secret sauces behind our exit, was how can we adjust the grammar definition to allow for a suggestive engine. I would love to talk about this piece in more detail, but am under a non-competition agreement until the Fall of 2020.

MD: What is the next big thing for data science or machine learning? What risks lie ahead with machine learning?

My view of what is big may not be what the world thinks is big. We will make tremendous progress on small tasks. For example, you can argue that AI will have a profound impact on, or even take away, certain jobs. But I do not consider this groundbreaking, because they just seem like the natural progression for AI.

broken image

As I view the world, the next big thing might be cyber-counseling, or an AI that can connect with and help people feel better. This AI would be a system that works with one’s self-perception, as one closed loop. One of the biggest social changes is that we are more concerned about human identity these days, and your or my well-being have a lot to do with our subjective self-perception.

In today’s world, we are putting increasing emphasis on helping people feel better compared to being fed and healthy, and this largely is uncharted territory with respect to technology. I sense several new major markets emerging from this line of thought in the next decade or so.

MD: You mention in your blog post “On Ethics of Applying Machine Learning” that a good practice is to have sufficient human interaction for any problem you want to use machine learning to solve. When will ML be efficient enough such that human interaction is unnecessary?

That blog post was about how we have to be smart with technology. Systems, institutions, and groups of people can often be remarkably unwise. These groups might not realize that a lot of simple technological ideas can help them build better programs. However, you cannot use tech blindly; we need stronger human brains.

If you want to optimize something, you cannot just use technology to find a better strategy, and then throw this strategy away because it violates some seemingly-intuitive constraints introduced by us, humans, in the first place. Instead, we have to slowly factor in the human strengths until we end up with results that we are confident are better than what humans alone could come up with. The post was inspired by an idea that generally uses AI technology to come up with the best strategy for a real-life problem and then bluntly throws away half of this strategy so that what’s left is culturally acceptable. I am arguing, in that blog post, that this is a textbook-perfect counter-example, and it is exactly the way to not apply the AI to our real-life problems.

MD: Do you have any ideas for projects you can develop for coronavirus relief?

The more I read about COVID, the more I am convinced it mostly is about informational warfare these days. The virus does highlight the deficiencies of our healthcare system, but it does so even more when it comes to our informational hygiene. Dealing with information in the 21st century was supposed to be about finding the best strategies to get the right information out to people, whereas, sadly, what we are seeing is largely the opposite.

A project that could help with coronavirus relief would, I think, begin from exposing the data, diligently and openly, out there in the air, so that not only would we have the dashboards showing big red circles with the numbers of deaths next to them, but it be more of a Jupyter notebook style shared workplace.

I know I am daydreaming here, of course, but we do have all the tech available to make something like this happen. It’s only about the critical mass of people who would want to look this way. And it’s not something Bill Gates or Mark Zuckerberg could help put together, the times have changed. I hope that Peter Thiel or Elon Musk are looking in this exact direction as we speak.

My Experience With Coronavirus

Why did Coronavirus Spread so Fast?

Coronavirus and Globalization Moving Forward

Disinfecting Surfaces Against Coronavirus

Contagion Risks from Coronavirus

Coronavirus Oxygen Supplementation 101

Coronavirus: The Global Economic Impact

Home Care for Coronavirus

Coronavirus Causes Long Term Problems?

Online Coronavirus Scams Proliferate

What Is The True Coronavirus Case Fatality Rate For Young People?

How Likely Are Young People to be Hospitalized With Coronavirus?

Living On The Edge of A New Society

Coronavirus Will Test the Limits of Our Hospitals

Coronavirus Catapults Global Testing Innovation

Spain Suffers Under Coronavirus

Data, Models & Misinformation on the Coronavirus

Origins of the Coronavirus

Coronavirus Travels the Silk Road

Coronavirus Attacks Italy's Sick and Elderly

Is the New Coronavirus Drug a Cure?

What is the Mystery of Germany's Low Coronavirus Fatality Rate?

Coronavirus & the Economy

The World Will Be More Technologically Advanced After the COVID-19 Pandemic

Why has the Coronavirus Not Exploded in Japan?

Italy's Coronavirus Death Rate is Falling

Conquering The Coronavirus

Coronavirus Speeds Up Robotic Revolution

Economic Depression Will Destroy More Lives Than Coronavirus

Can Hydroxychloroquine be Used to Treat Coronavirus?

Northern Italy & Wuhan: Partners for Better or Worse

The Race for the Coronavirus Cure

How Did Taiwan Manage the Coronavirus so Well?

What is the US Coronavirus Fatality Rate?

Travel Ban Saves Airlines Billions

Coronavirus Superspreader?

Deep Learning Detects Coronavirus

Singapore's Coronavirus Patients Have a 0% Mortality Rate So Far... Why?

AI is Mapping the Coronavirus and Inferring its Possible Economic Impact

Coronavirus: Fact from Fiction

Death From Covid-19 is Not From the Coronavirus:

An Interview With NYU Langone Health Professor & Rheumatologist Dr. Gary Solomon

Coronavirus Attacks Italy's Sick and Elderly

Interview with NASA Astronaut Scott Kelly: An American Hero​

13 Questions With General David Petraeus

 

Why Choose Machine Learning Investing Over A Traditional Financial Advisor?

Interview With Home Depot Co-Founder Ken Langone

Interview with the Inventor of Amazon's Alexa

Automation and the Rebirth of American Retail

China Debuts Stealth Unmanned Combat Aerial Vehicle

Sweden's Economy Embraces AI & Automation

Austria's Automated Ai & Robotic Future Is Now

Nuclear Submarines: A 7,000 Lb Swiss Watch

Ai Can Write Its Own Computer Program

On Black Holes: Gateway to Another Dimension, or Ghosts of Stars’ Pasts?

Egypt's Artificial Intelligence Future

Supersonic Travel: The Future of Aviation

Was Our Moon Once Habitable?

The Modern Global Arms Race

NASA Seeks New Worlds

Cowboy Turned Space Surgeon

Shedding Light on Dark Matter: Using Machine Learning to Unravel Physics’ Hardest Questions

When High-Tech Meets Low-Tech Economy: Ai & the Construction Industry

Aquaponics: How Advanced Technology Grows Vegetables In The Desert

The World Cup Does Not Have a Lasting Positive Impact on Hosting Countries

Artificial Intelligence is Transforming the Forex Market

Do Machines Dream? Inside the Dreams of a Machine

Can Ai Replace Human Ski Coaches?

America’s Next Spy Plane

Faster than Sound and Undetectable by Radar

The Implications of Machine Learning on Condensed Matter Physics & Quantum Computing

Crafting Eco-Sustainability: WTC and Environmental Sustainability

Can Ai Transform Swimming?

Argentina's AI Future: Reversing a Century of Decline

Tennis & Artificial Intelligence

Kazakhstan's Ai Aspirations

Peru's Ai Future Will Drive Economic Growth

The Colombian Approach to the AI Revolution

How AI Can Explain Its Thinking

Singapore: Ai & Robotic City

Ai in New Zealand

Brazil & Artificial Intelligence​

Denmark & Ai

Can Ai Replace Human Ski Coaches?

Tennis & Artificial Intelligence

My Experience With Coronavirus

Why did Coronavirus Spread so Fast?

Coronavirus and Globalization Moving Forward

Disinfecting Surfaces Against Coronavirus

Contagion Risks from Coronavirus

Coronavirus Oxygen Supplementation 101

Coronavirus: The Global Economic Impact

Home Care for Coronavirus

Coronavirus Causes Long Term Problems?

Online Coronavirus Scams Proliferate

What Is The True Coronavirus Case Fatality Rate For Young People?

How Likely Are Young People to be Hospitalized With Coronavirus?

Living On The Edge of A New Society

Coronavirus Will Test the Limits of Our Hospitals

Coronavirus Catapults Global Testing Innovation

Spain Suffers Under Coronavirus

Data, Models & Misinformation on the Coronavirus

Origins of the Coronavirus

Coronavirus Travels the Silk Road

Coronavirus Attacks Italy's Sick and Elderly

Is the New Coronavirus Drug a Cure?

What is the Mystery of Germany's Low Coronavirus Fatality Rate?

Coronavirus & the Economy

The World Will Be More Technologically Advanced After the COVID-19 Pandemic

Why has the Coronavirus Not Exploded in Japan?

Italy's Coronavirus Death Rate is Falling

Conquering The Coronavirus

Coronavirus Speeds Up Robotic Revolution

Economic Depression Will Destroy More Lives Than Coronavirus

Can Hydroxychloroquine be Used to Treat Coronavirus?

Northern Italy & Wuhan: Partners for Better or Worse

The Race for the Coronavirus Cure

How Did Taiwan Manage the Coronavirus so Well?

What is the US Coronavirus Fatality Rate?

Travel Ban Saves Airlines Billions

Coronavirus Superspreader?

Deep Learning Detects Coronavirus

Singapore's Coronavirus Patients Have a 0% Mortality Rate So Far... Why?

AI is Mapping the Coronavirus and Inferring its Possible Economic Impact

Coronavirus: Fact from Fiction

Death From Covid-19 is Not From the Coronavirus:

An Interview With NYU Langone Health Professor & Rheumatologist Dr. Gary Solomon

Coronavirus Attacks Italy's Sick and Elderly

Interview with NASA Astronaut Scott Kelly: An American Hero​

13 Questions With General David Petraeus

Why Choose Machine Learning Investing Over A Traditional Financial Advisor?

Interview With Home Depot Co-Founder Ken Langone

Interview with the Inventor of Amazon's Alexa

Automation and the Rebirth of American Retail

China Debuts Stealth Unmanned Combat Aerial Vehicle

Sweden's Economy Embraces AI & Automation

Austria's Automated Ai & Robotic Future Is Now

Nuclear Submarines: A 7,000 Lb Swiss Watch

Ai Can Write Its Own Computer Program

On Black Holes: Gateway to Another Dimension, or Ghosts of Stars’ Pasts?

Egypt's Artificial Intelligence Future

Supersonic Travel: The Future of Aviation

Was Our Moon Once Habitable?

The Modern Global Arms Race

NASA Seeks New Worlds

Cowboy Turned Space Surgeon

Shedding Light on Dark Matter: Using Machine Learning to Unravel Physics’ Hardest Questions

When High-Tech Meets Low-Tech Economy: Ai & the Construction Industry

Aquaponics: How Advanced Technology Grows Vegetables In The Desert

The World Cup Does Not Have a Lasting Positive Impact on Hosting Countries

Artificial Intelligence is Transforming the Forex Market

Do Machines Dream? Inside the Dreams of a Machine

Can Ai Replace Human Ski Coaches?

America’s Next Spy Plane

Faster than Sound and Undetectable by Radar

The Implications of Machine Learning on Condensed Matter Physics & Quantum Computing

Crafting Eco-Sustainability: WTC and Environmental Sustainability

Can Ai Transform Swimming?

Argentina's AI Future: Reversing a Century of Decline

Tennis & Artificial Intelligence

Kazakhstan's Ai Aspirations

Peru's Ai Future Will Drive Economic Growth

The Colombian Approach to the AI Revolution

How AI Can Explain Its Thinking

Singapore: Ai & Robotic City

Ai in New Zealand

Brazil & Artificial Intelligence​

Denmark & Ai

Can Ai Replace Human Ski Coaches?

Tennis & Artificial Intelligence

My Experience With Coronavirus

Why did Coronavirus Spread so Fast?

Coronavirus and Globalization Moving Forward

Disinfecting Surfaces Against Coronavirus

Contagion Risks from Coronavirus

Coronavirus Oxygen Supplementation 101

Coronavirus: The Global Economic Impact

Home Care for Coronavirus

Coronavirus Causes Long Term Problems?

Online Coronavirus Scams Proliferate

What Is The True Coronavirus Case Fatality Rate For Young People?

How Likely Are Young People to be Hospitalized With Coronavirus?

Living On The Edge of A New Society

Coronavirus Will Test the Limits of Our Hospitals

Coronavirus Catapults Global Testing Innovation

Spain Suffers Under Coronavirus

Data, Models & Misinformation on the Coronavirus

Origins of the Coronavirus

Coronavirus Travels the Silk Road

Coronavirus Attacks Italy's Sick and Elderly

Is the New Coronavirus Drug a Cure?

What is the Mystery of Germany's Low Coronavirus Fatality Rate?

Coronavirus & the Economy

The World Will Be More Technologically Advanced After the COVID-19 Pandemic

Why has the Coronavirus Not Exploded in Japan?

Italy's Coronavirus Death Rate is Falling

Conquering The Coronavirus

Coronavirus Speeds Up Robotic Revolution

Economic Depression Will Destroy More Lives Than Coronavirus

Can Hydroxychloroquine be Used to Treat Coronavirus?

Northern Italy & Wuhan: Partners for Better or Worse

The Race for the Coronavirus Cure

How Did Taiwan Manage the Coronavirus so Well?

What is the US Coronavirus Fatality Rate?

Travel Ban Saves Airlines Billions

Coronavirus Superspreader?

Deep Learning Detects Coronavirus

Singapore's Coronavirus Patients Have a 0% Mortality Rate So Far... Why?

AI is Mapping the Coronavirus and Inferring its Possible Economic Impact

Coronavirus: Fact from Fiction

Death From Covid-19 is Not From the Coronavirus:

An Interview With NYU Langone Health Professor & Rheumatologist Dr. Gary Solomon

Coronavirus Attacks Italy's Sick and Elderly

Interview with NASA Astronaut Scott Kelly: An American Hero​

13 Questions With General David Petraeus

Why Choose Machine Learning Investing Over A Traditional Financial Advisor?

Interview With Home Depot Co-Founder Ken Langone

Interview with the Inventor of Amazon's Alexa

Automation and the Rebirth of American Retail

China Debuts Stealth Unmanned Combat Aerial Vehicle

Sweden's Economy Embraces AI & Automation

Austria's Automated Ai & Robotic Future Is Now

Nuclear Submarines: A 7,000 Lb Swiss Watch

Ai Can Write Its Own Computer Program

On Black Holes: Gateway to Another Dimension, or Ghosts of Stars’ Pasts?

Egypt's Artificial Intelligence Future

Supersonic Travel: The Future of Aviation

Was Our Moon Once Habitable?

The Modern Global Arms Race

NASA Seeks New Worlds

Cowboy Turned Space Surgeon

Shedding Light on Dark Matter: Using Machine Learning to Unravel Physics’ Hardest Questions

When High-Tech Meets Low-Tech Economy: Ai & the Construction Industry

Aquaponics: How Advanced Technology Grows Vegetables In The Desert

The World Cup Does Not Have a Lasting Positive Impact on Hosting Countries

Artificial Intelligence is Transforming the Forex Market

Do Machines Dream? Inside the Dreams of a Machine

Can Ai Replace Human Ski Coaches?

America’s Next Spy Plane

Faster than Sound and Undetectable by Radar

The Implications of Machine Learning on Condensed Matter Physics & Quantum Computing

Crafting Eco-Sustainability: WTC and Environmental Sustainability

Can Ai Transform Swimming?

Argentina's AI Future: Reversing a Century of Decline

Tennis & Artificial Intelligence

Kazakhstan's Ai Aspirations

Peru's Ai Future Will Drive Economic Growth

The Colombian Approach to the AI Revolution

How AI Can Explain Its Thinking

Singapore: Ai & Robotic City

Ai in New Zealand

Brazil & Artificial Intelligence​

Denmark & Ai

Can Ai Replace Human Ski Coaches?

Tennis & Artificial Intelligence

My Experience With Coronavirus

Why did Coronavirus Spread so Fast?

Coronavirus and Globalization Moving Forward

Disinfecting Surfaces Against Coronavirus

Contagion Risks from Coronavirus

Coronavirus Oxygen Supplementation 101

Coronavirus: The Global Economic Impact

Home Care for Coronavirus

Coronavirus Causes Long Term Problems?

Online Coronavirus Scams Proliferate

What Is The True Coronavirus Case Fatality Rate For Young People?

How Likely Are Young People to be Hospitalized With Coronavirus?

Living On The Edge of A New Society

Coronavirus Will Test the Limits of Our Hospitals

Coronavirus Catapults Global Testing Innovation

Spain Suffers Under Coronavirus

Data, Models & Misinformation on the Coronavirus

Origins of the Coronavirus

Coronavirus Travels the Silk Road

Coronavirus Attacks Italy's Sick and Elderly

Is the New Coronavirus Drug a Cure?

What is the Mystery of Germany's Low Coronavirus Fatality Rate?

Coronavirus & the Economy

The World Will Be More Technologically Advanced After the COVID-19 Pandemic

Why has the Coronavirus Not Exploded in Japan?

Italy's Coronavirus Death Rate is Falling

Conquering The Coronavirus

Coronavirus Speeds Up Robotic Revolution

Economic Depression Will Destroy More Lives Than Coronavirus

Can Hydroxychloroquine be Used to Treat Coronavirus?

Northern Italy & Wuhan: Partners for Better or Worse

The Race for the Coronavirus Cure

How Did Taiwan Manage the Coronavirus so Well?

What is the US Coronavirus Fatality Rate?

Travel Ban Saves Airlines Billions

Coronavirus Superspreader?

Deep Learning Detects Coronavirus

Singapore's Coronavirus Patients Have a 0% Mortality Rate So Far... Why?

AI is Mapping the Coronavirus and Inferring its Possible Economic Impact

Coronavirus: Fact from Fiction

Death From Covid-19 is Not From the Coronavirus:

An Interview With NYU Langone Health Professor & Rheumatologist Dr. Gary Solomon

Coronavirus Attacks Italy's Sick and Elderly

Interview with NASA Astronaut Scott Kelly: An American Hero​

13 Questions With General David Petraeus

Why Choose Machine Learning Investing Over A Traditional Financial Advisor?

Interview With Home Depot Co-Founder Ken Langone

Interview with the Inventor of Amazon's Alexa

Automation and the Rebirth of American Retail

China Debuts Stealth Unmanned Combat Aerial Vehicle

Sweden's Economy Embraces AI & Automation

Austria's Automated Ai & Robotic Future Is Now

Nuclear Submarines: A 7,000 Lb Swiss Watch

Ai Can Write Its Own Computer Program

On Black Holes: Gateway to Another Dimension, or Ghosts of Stars’ Pasts?

Egypt's Artificial Intelligence Future

Supersonic Travel: The Future of Aviation

Was Our Moon Once Habitable?

The Modern Global Arms Race

NASA Seeks New Worlds

Cowboy Turned Space Surgeon

Shedding Light on Dark Matter: Using Machine Learning to Unravel Physics’ Hardest Questions

When High-Tech Meets Low-Tech Economy: Ai & the Construction Industry

Aquaponics: How Advanced Technology Grows Vegetables In The Desert

The World Cup Does Not Have a Lasting Positive Impact on Hosting Countries

Artificial Intelligence is Transforming the Forex Market

Do Machines Dream? Inside the Dreams of a Machine

Can Ai Replace Human Ski Coaches?

America’s Next Spy Plane

Faster than Sound and Undetectable by Radar

The Implications of Machine Learning on Condensed Matter Physics & Quantum Computing

Crafting Eco-Sustainability: WTC and Environmental Sustainability

Can Ai Transform Swimming?

Argentina's AI Future: Reversing a Century of Decline

Tennis & Artificial Intelligence

Kazakhstan's Ai Aspirations

Peru's Ai Future Will Drive Economic Growth

The Colombian Approach to the AI Revolution

How AI Can Explain Its Thinking

Singapore: Ai & Robotic City

Ai in New Zealand

Brazil & Artificial Intelligence​

Denmark & Ai

Can Ai Replace Human Ski Coaches?

Tennis & Artificial Intelligence

Written by Michael Ding & Edited by Alexander Fleiss