The Data Scientist Clarifies the Question – Dengue Data Search

Update to this blog – Since it was such an interesting question, I decided to continue my search for city level weekly dengue case data to answer the original question. This GitHub link provides my research notes to date.


The Data Lass

DataScience_LifeCycleOne of the first steps in the Data Science process is identifying what data you need to answer the question. In March 2017, I featured a series of blogs about characteristics of a data scientist. Today I want to add to that discussion by giving a case study of how clarifying questions is also a key part of the data science process.

It all began when I decided to participate in a crowdsourced Driven Data competition to predict local epidemics of dengue fever. I’m passionate about using machine learning and predictive analytics to solve some of the most challenging questions and thought this would be an excellent use of free time. I love learning new domains and data mining techniques to add improve my skills and help others at the same time.

Dengue fever is a mosquito-borne disease with 60,000 reported cases in Perú and Puerto Rico in 2016. (I…

View original post 596 more words

Leave a Reply

Fill in your details below or click an icon to log in: Logo

You are commenting using your account. Log Out /  Change )

Google photo

You are commenting using your Google account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s

This site uses Akismet to reduce spam. Learn how your comment data is processed.