Artwork

Inhalt bereitgestellt von DataStax and Charna Parkey. Alle Podcast-Inhalte, einschließlich Episoden, Grafiken und Podcast-Beschreibungen, werden direkt von DataStax and Charna Parkey oder seinem Podcast-Plattformpartner hochgeladen und bereitgestellt. Wenn Sie glauben, dass jemand Ihr urheberrechtlich geschütztes Werk ohne Ihre Erlaubnis nutzt, können Sie dem hier beschriebenen Verfahren folgen https://de.player.fm/legal.
Player FM - Podcast-App
Gehen Sie mit der App Player FM offline!

The Ethical Path to High-Quality AI Data with Fabiana Clemente

50:12
 
Teilen
 

Manage episode 443179556 series 3604986
Inhalt bereitgestellt von DataStax and Charna Parkey. Alle Podcast-Inhalte, einschließlich Episoden, Grafiken und Podcast-Beschreibungen, werden direkt von DataStax and Charna Parkey oder seinem Podcast-Plattformpartner hochgeladen und bereitgestellt. Wenn Sie glauben, dass jemand Ihr urheberrechtlich geschütztes Werk ohne Ihre Erlaubnis nutzt, können Sie dem hier beschriebenen Verfahren folgen https://de.player.fm/legal.

How can we accelerate AI while protecting privacy? Fabiana Clemente discusses founding YData to enable high-quality synthetic data for machine learning. She covers open sourcing data profiling tools, the impact of generative AI on synthetic data, and maintaining work-life balance as an introvert leader.
Timestamps

(00:02:29) Fabiana's journey starting YData and becoming a public speaker

(00:20:19) Misconceptions and hype around generative AI and AGI

(00:32:46) Potential real-world impact and use cases of LLMs today

(00:34:55) The role of synthetic data in making AI models more robust and fair

(00:43:55) Advice for founders: value your time and learn to say no

(00:48:24) The importance of technical leaders being able to communicate well

Quotes

Charna Parkey:

"It's a balance. I think that's also what led us to some of the demographic based data science. Essentially, folks were making like event data into pre-aggregated data. And then they were trying to obscure it so much that you couldn't get back to the person. And so you're like, okay, what's their age and what's their gender? And you're like, that's not actually the most useful part of data science that can't predict behavior or intent or any of that. It throws out time as a component of the entire process, seasonality, everything. And so there just, there has to be a better way."

Fabiana Clemente:

"I have to say, that's a very beautiful way to put it. Hallucinations, I have to say. I never thought about that. And it makes a lot of sense. I do think, though, that in terms of LLMs, it's so language, it's so definitely, it sounds like we are getting very, very intelligent system, exactly, because language is very complex. And we know that was needed for the leap of humanity. I do think there are other, the sense of combining. Well, and here we enter in the multimodal kind of space. It's what's missing."

Links
Connect with Charna
Connect with Fabiana

  continue reading

103 Episoden

Artwork
iconTeilen
 
Manage episode 443179556 series 3604986
Inhalt bereitgestellt von DataStax and Charna Parkey. Alle Podcast-Inhalte, einschließlich Episoden, Grafiken und Podcast-Beschreibungen, werden direkt von DataStax and Charna Parkey oder seinem Podcast-Plattformpartner hochgeladen und bereitgestellt. Wenn Sie glauben, dass jemand Ihr urheberrechtlich geschütztes Werk ohne Ihre Erlaubnis nutzt, können Sie dem hier beschriebenen Verfahren folgen https://de.player.fm/legal.

How can we accelerate AI while protecting privacy? Fabiana Clemente discusses founding YData to enable high-quality synthetic data for machine learning. She covers open sourcing data profiling tools, the impact of generative AI on synthetic data, and maintaining work-life balance as an introvert leader.
Timestamps

(00:02:29) Fabiana's journey starting YData and becoming a public speaker

(00:20:19) Misconceptions and hype around generative AI and AGI

(00:32:46) Potential real-world impact and use cases of LLMs today

(00:34:55) The role of synthetic data in making AI models more robust and fair

(00:43:55) Advice for founders: value your time and learn to say no

(00:48:24) The importance of technical leaders being able to communicate well

Quotes

Charna Parkey:

"It's a balance. I think that's also what led us to some of the demographic based data science. Essentially, folks were making like event data into pre-aggregated data. And then they were trying to obscure it so much that you couldn't get back to the person. And so you're like, okay, what's their age and what's their gender? And you're like, that's not actually the most useful part of data science that can't predict behavior or intent or any of that. It throws out time as a component of the entire process, seasonality, everything. And so there just, there has to be a better way."

Fabiana Clemente:

"I have to say, that's a very beautiful way to put it. Hallucinations, I have to say. I never thought about that. And it makes a lot of sense. I do think, though, that in terms of LLMs, it's so language, it's so definitely, it sounds like we are getting very, very intelligent system, exactly, because language is very complex. And we know that was needed for the leap of humanity. I do think there are other, the sense of combining. Well, and here we enter in the multimodal kind of space. It's what's missing."

Links
Connect with Charna
Connect with Fabiana

  continue reading

103 Episoden

Alle Folgen

×
 
Loading …

Willkommen auf Player FM!

Player FM scannt gerade das Web nach Podcasts mit hoher Qualität, die du genießen kannst. Es ist die beste Podcast-App und funktioniert auf Android, iPhone und im Web. Melde dich an, um Abos geräteübergreifend zu synchronisieren.

 

Kurzanleitung

Hören Sie sich diese Show an, während Sie die Gegend erkunden
Abspielen