Open||Source||Data podcast

The Ethical Path to High-Quality AI Data with Fabiana Clemente

0:00
50:12
Recuar 15 segundos
Avançar 15 segundos

Timestamps

(00:02:29) Fabiana's journey starting YData and becoming a public speaker 

(00:20:19)  Misconceptions and hype around generative AI and AGI 

(00:32:46) Potential real-world impact and use cases of LLMs today 

(00:34:55) The role of synthetic data in making AI models more robust and fair 

(00:43:55) Advice for founders: value your time and learn to say no 

(00:48:24) The importance of technical leaders being able to communicate well 
 

Quotes

Charna Parkey:

 "It's a balance. I think that's also what led us to some of the demographic based data science. Essentially, folks were making like event data into pre-aggregated data. And then they were trying to obscure it so much that you couldn't get back to the person. And so you're like, okay, what's their age and what's their gender? And you're like, that's not actually the most useful part of data science that can't predict behavior or intent or any of that. It throws out time as a component of the entire process, seasonality, everything. And so there just, there has to be a better way."

Fabiana Clemente:

 "I have to say, that's a very beautiful way to put it. Hallucinations, I have to say. I never thought about that. And it makes a lot of sense. I do think, though, that in terms of LLMs, it's so language, it's so definitely, it sounds like we are getting very, very intelligent system, exactly, because language is very complex. And we know that was needed for the leap of humanity. I do think there are other, the sense of combining. Well, and here we enter in the multimodal kind of space. It's what's missing."

 

Links

Connect with Charna
Connect with Fabiana

 

Mais episódios de "Open||Source||Data"