Seeing is believing!

Before you order, simply sign up for a free user account and in seconds you'll be experiencing the best in CFA exam preparation.

Subject 1. Steps in Executing a Data Analysis Project PDF Download
Big data is defined by the 4Vs:

  • Volume: huge amount of data.
  • Variety: the array of available data sources.
  • Velocity: the high speed of accumulation of data.
  • Veracity: the credibility and reliability of different data sources.

The main steps for traditional ML model building are:

  • conceptualization of the problem: state the problem, define objectives, identify useful data points, and conceptualize the model. It is like a blueprint.
  • data collection: search for and download the raw data from one or multiple sources.
  • data preparation and wrangling: cleansing and organizing raw data into a consolidated format.
  • data exploration
  • model training

For textual ML model building, the first four steps differ somewhat from those used in the traditional model:

  • text problem formulation
  • text curation
  • text preparation and wrangling
  • text exploration
  • model training

Note the last step is the same for both: model training.

Learning Outcome Statements

a. state and explain steps in a data analysis project;

CFA® 2022 Level II Curriculum, , Volume 1, Reading 8

User Contributed Comments 0

You need to log in first to add your comment.
I am using your study notes and I know of at least 5 other friends of mine who used it and passed the exam last Dec. Keep up your great work!


My Own Flashcard

No flashcard found. Add a private flashcard for the subject.