The data life cycle consists of six stages:

  • Plan
    Decide what kind of data is needed, how it will be managed, and who will be responsible for it.
  • Capture
    Collect data or bring it into the organization from a variety of sources.
  • Manage
    Maintain and care for the data, including determining how and where it is stored and which tools are used to manage it.
  • Analyze
    Use the data to solve problems, support decision-making, and achieve business goals.
  • Archive
    Store relevant data for long-term use or future reference.
  • Destroy
    Remove data from storage and delete all shared copies.

Important Note

The six stages of the data life cycle (plan, capture, manage, analyze, archive, destroy) should not be confused with the six phases of the data analysis process (ask, prepare, process, analyze, share, act).

These two frameworks serve different purposes and are not interchangeable.


Variations of the Data Life Cycle Across Organizations

The data life cycle provides a general framework for how data is managed. However, depending on organizational goals, industry, or sector, the data life cycle may be defined differently. Below are examples of how various institutions approach the data life cycle.


U.S. Fish and Wildlife Service

The U.S. Fish and Wildlife Service uses the following data life cycle:

  • Plan
  • Acquire
  • Maintain
  • Access
  • Evaluate
  • Archive

U.S. Geological Survey (USGS)

The U.S. Geological Survey uses this data life cycle:

  • Plan
  • Acquire
  • Process
  • Analyze
  • Preserve
  • Publish / Share

Across all stages, the following cross-cutting activities are performed:

  • Describe (metadata and documentation)
  • Manage quality
  • Backup and secure

Financial Institutions

Financial institutions may follow a data life cycle such as the one described in Strategic Finance magazine:

  • Capture
  • Qualify
  • Transform
  • Utilize
  • Report
  • Archive
  • Purge

Harvard Business School (HBS)

A data life cycle informed by Harvard University research consists of eight stages:

  • Generation
  • Collection
  • Processing
  • Storage
  • Management
  • Analysis
  • Visualization
  • Interpretation

Key Takeaways

Understanding the importance of the data life cycle is essential for success as a data analyst. Individual stages may vary by organization, industry, or sector.

  • The U.S. Fish and Wildlife Service and the USGS place strong emphasis on archiving and backing up historical data.
  • Harvard Business School focuses on research and teaching, which is why its data life cycle includes visualization and interpretation, even though these stages are often associated with a data analysis process rather than a data life cycle. The HBS model does not explicitly include a data purge or destruction stage.
  • In contrast, financial institutions clearly distinguish between archiving and purging data.

In summary, while data life cycles differ across organizations, one data management principle is universal:
Data must be governed so that it is accurate, secure, and available to meet an organization’s needs.