AP Courses AP Biology AP Biology Units AP Human Geography AP HUG Units AP Computer Science Principles AP CSP Units
Practice Daily Practice Practice by Course Practice by Topic Practice Tests
AP Exam Resources AP Exam Dates Registration Fees Scores & Credit What to Bring
Start Practicing → Login Register →

AP Computer Science Principles · Unit 2 · Data

AP CSP Unit 2 Notes: Data

Supporting resource for AP CSP Unit 2—use with concept guides and the full practice set.

These AP CSP Unit 2 notes turn the Data outline into readable sections—bits through privacy—with links to full guides when you want worked examples and eight-question drills on each concept.

Updated May 21, 2026Reviewed by APScore5 Editorial Team

Step 1Unit 2 Guide14-step journey map Step 2ReviewBig ideas and exam weight You are hereAP CSP Unit 2 Notes: DataOpen related concept guides next Step 450 MCQsFull practice with weak-area links
Direct answer

These AP CSP Unit 2 notes condense Data vocabulary into a printable outline—binary through privacy—linked to concept guides and practice.

Binary and storage

Bits, bytes, and place values

A bit is one binary digit (0 or 1). A byte is eight bits and provides 256 distinct patterns, often shown as 0–255 unsigned. Binary place values double from the right: 1, 2, 4, 8, 16. Convert by summing places with 1 bits; convert decimal to binary by subtracting the largest fitting power of two.

Overflow occurs when a result needs more bits than allowed—255 + 1 in eight unsigned bits wraps to 0. The Ariane 5 story reminds you that real systems fail when width is ignored.

  • Bit = single switch; byte = 8 bits
  • 2ⁿ distinct values for n bits
  • ASCII: 'A' = 65 (one byte in simple English text)

Compression

Ratios, RLE, and format families

Compression shrinks files for storage and transfer. Ratio = original size : compressed size (same units). 100 MB → 25 MB is 4:1 and 75% saved—do not swap those answers.

Run-length encoding (RLE) stores count + symbol for repeats (AAAA → 4A). It helps simple graphics, not noisy photos.

  • Lossless: ZIP, PNG, FLAC — exact rebuild
  • Lossy: JPEG, MP3 — smaller, some detail discarded
  • Random data compresses poorly; repetitive data compresses well

Information and metadata

From raw values to answers

Data becomes information when processing answers a question. Filtering keeps rows that meet criteria; cleaning fixes typos, duplicates, and inconsistent formats before charts run.

Metadata describes other data—timestamps, device model, GPS in EXIF photo tags. Publishing photos without removing location metadata can expose places you never typed in the caption.

Correlation shows variables moving together; causation requires evidence that one drives the other. Summer heat can correlate ice cream sales and drownings without ice cream causing drownings.

Big data and society

Scale, models, and responsibility

Volume, velocity, and variety characterize big data. Fitness trackers illustrate velocity; mixed survey plus image archives illustrate variety.

Machine learning trains on examples. Biased or incomplete training data produces biased predictions. Sampling bias misrepresents populations; measurement bias comes from broken sensors or bad labels.

PII identifies people. Re-identification links “anonymous” rows back to individuals when fields combine narrowly. GDPR names strict EU personal-data rules you may cite on societal-impact items.

Lists and programs with data

Filtering, cleaning, and tracing

Filtering keeps rows that meet a rule; sorting only reorders. Cleaning fixes typos, duplicate rows, or inconsistent date formats before charts run. Pseudocode may loop through a list of sensor readings—each item still occupies bytes in memory.

Example: a list of daily steps [4200, 5100, 5100, 0] might need cleaning when 0 means the tracker failed, not that the student did not walk. Removing bad rows changes averages; document that choice in written responses.

Network size vocabulary

Mbps versus MB

Mbps measures megabits per second on a connection; MB measures megabytes of file size. Eight bits per byte means 8 Mbps is about 1 MB per second in ideal conditions—real networks add overhead, but AP items test whether you label units correctly.

How to use these notes with practice

Skim this page before class, then open the matching concept guide when a topic feels fuzzy. Pair with the cheat sheet for formulas and with fifty practice MCQs after you can explain each heading without looking.

Rewrite each bullet in your own words once per week. If you cannot explain overflow without reading, spend ten minutes on binary numbers before moving to compression.

Quick answers

Frequently asked questions

What topics do these AP CSP Unit 2 notes cover?

The outline covers binary numbers, bits and bytes, binary-to-decimal conversion, data compression, lossless vs lossy compression, metadata, and big data with privacy, PII, re-identification, and data bias. Use concept guides for worked examples and MCQs.

Are these Unit 2 notes enough to pass the exam?

Notes help you recall vocabulary and formulas, but AP CSP also tests scenario reasoning. Pair this page with concept guides, the eighteen-question quiz, and the fifty-question practice set so terms connect to questions.

What is the best study order with these notes?

Skim notes once, then read binary numbers, bits and bytes, and binary conversion guides. Follow with compression and metadata, then big data and privacy. Finish with flashcards, the quiz, and full practice.

How should I use notes with flashcards?

Read a notes section, then run the matching flashcard slice. If a card still feels vague, reopen the concept guide for that term. Flashcards test recall; notes give the outline structure.

Should I read notes before the Unit 2 quiz?

Yes for a quick refresh, but take the quiz closed-book after one pass. Notes prime memory; the quiz shows what still needs repair before the fifty-question set.

What binary conversion fact belongs on my notes sheet?

Write place values 1, 2, 4, 8, 16 and the rule: add powers where bits are 1. Include quick checks: 101₂ = 5 and 1010₂ = 10. That covers most AP-sized conversion items.

What compression facts should I jot down?

Compression ratio = original : compressed; lossless rebuilds exactly (ZIP, PNG for graphics); lossy trades detail for size (JPEG, MP3). Note one photo vs screenshot example beside each label.

What metadata and privacy reminders fit on notes?

Metadata is data about data (author, time, GPS). PII identifies a person; re-identification can link anonymous records back. Biased training data can produce unfair model outcomes—name a concrete example.

Can I print these AP CSP Unit 2 notes?

Yes. Use print or save as PDF; headings stay readable on letter paper. This page is for review after you learn concepts—open guides for depth and practice for application.

What should I do after finishing the notes outline?

Use the cheat sheet for a five-minute drill, take the eighteen-question quiz, then the fifty practice MCQs. Revisit any concept guide tied to missed quiz or practice tags.

50 MCQs Unit 2 Guide