98. Mike Tung - Are knowledge graphs AI’s next big thing?

13/10/2021

Towards Data Science

0:00

48:56

As impressive as they are, language models like GPT-3 and BERT all have the same problem: they’re trained on reams of internet data to imitate human writing. And human writing is often wrong, biased, or both, which means language models are trying to emulate an imperfect target.

Language models often babble, or make up answers to questions they don’t understand. And it can make them unreliable sources of truth. Which is why there’s been increased interest in alternative ways to retrieve information from large datasets — approaches that include knowledge graphs.

Knowledge graphs encode entities like people, places and objects into nodes, which are then connected to other entities via edges, which specify the nature of the relationship between the two. For example, a knowledge graph might contain a node for Mark Zuckerberg, linked to another node for Facebook, via an edge that indicates that Zuck is Facebook’s CEO. Both of these nodes might in turn be connected to dozens, or even thousands of others, depending on the scale of the graph.

Knowledge graphs are an exciting path ahead for AI capabilities, and the world’s largest knowledge graphs are trained by a company called Diffbot, whose CEO Mike Tung joined me for this episode of the podcast to discuss where knowledge graphs can improve on more standard techniques, and why they might be a big part of the future of AI.

---

Intro music by:

➞ Artist: Ron Gelinas

➞ Track Title: Daybreak Chill Blend (original mix)

➞ Link to Track: https://youtu.be/d8Y2sKIgFWc

---

0:00 Intro

1:30 The Diffbot dynamic

3:40 Knowledge graphs

7:50 Crawling the internet

17:15 What makes this time special?

24:40 Relation to neural networks

29:30 Failure modes

33:40 Sense of competition

39:00 Knowledge graphs for discovery

45:00 Consensus to find truth

48:15 Wrap-up

Mais episódios de "Towards Data Science"

Mais episódios

Descobre o mundo dos podcasts com a app gratuita GetPodcast.

Subscreve os teus podcasts preferidos, ouve episódios offline e obtém recomendações fantásticas.

98. Mike Tung - Are knowledge graphs AI’s next big thing?

Towards Data Science

Mais episódios de "Towards Data Science"

130. Edouard Harris - New Research: Advanced AI may tend to seek power by default

129. Amber Teng - Building apps with a new generation of language models

128. David Hirko - AI observability and data as a cybersecurity weakness

127. Matthew Stewart - The emerging world of ML sensors

126. JR King - Does the brain run on deep learning?

125. Ryan Fedasiuk - Can the U.S. and China collaborate on AI safety?

124. Alex Watson - Synthetic data could change everything

123. Ala Shaabana and Jacob Steeves - AI on the blockchain (it actually might just make sense)

122. Sadie St. Lawrence - Trends in data science

121. Alexei Baevski - data2vec and the future of multimodal learning

98. Mike Tung - Are knowledge graphs AI’s next big thing?

Towards Data Science

Mais episódios de "Towards Data Science"

130. Edouard Harris - New Research: Advanced AI may tend to seek power *by default*

129. Amber Teng - Building apps with a new generation of language models

128. David Hirko - AI observability and data as a cybersecurity weakness

127. Matthew Stewart - The emerging world of ML sensors

126. JR King - Does the brain run on deep learning?

125. Ryan Fedasiuk - Can the U.S. and China collaborate on AI safety?

124. Alex Watson - Synthetic data could change everything

123. Ala Shaabana and Jacob Steeves - AI on the blockchain (it actually might just make sense)

122. Sadie St. Lawrence - Trends in data science

121. Alexei Baevski - data2vec and the future of multimodal learning

130. Edouard Harris - New Research: Advanced AI may tend to seek power by default