Dataset Creation and Curation

Dataset Creation and Curation

Actions and Detail Panel


Date and time


Online event

Process and tools for cost-effective dataset creation - Christiaan Swart

About this event


  • Different ways to collect data
  • Organizing the process for data collection
  • Active and distant learning
  • Finding issues in data

About the speaker:

Chris Swart has 6 years of experience delivering Natural Language Processing (NLP) services across the email, complaint, pharma, and sales industries. He has an interest in cost effective dataset creation with distant supervision and building semi-supervised datasets to get the best bang for buck for models. He cofounded Comtura on a mission to help sales teams weponise their customer's voices to sell more. At Comtura he leads a machine learning team of 3.

DataTalks.Club is the place to talk about data. Join our slack community!

Share with friends