Purpose: capture and preserve digital evidence without altering it.
Digital data is information represented using discrete values - typically 0s and 1s (binary).
In forensics, understanding how data is represented helps investigators identify, recover, and interpret evidence accurately.

These physical states correspond to binary digits - 0 and 1 - forming the foundation of all digital information.
We're going through it briefly.
| Power | 10⁵ 10×10×10×10×10 |
10⁴ 10×10×10×10 |
10³ 10×10×10 |
10² 10×10 |
10¹ 10×1 |
10⁰ 1 |
|---|---|---|---|---|---|---|
| Column Value | 100,000 | 10,000 | 1,000 | 100 | 10 | 1 |
| Decimal Number | 2 | 7 | 4 | 8 | ||
| Sum of These Values (2748) | 2,000 | 700 | 40 | 8 |
(2748)₁₀ = 2×10³ + 7×10² + 4×10¹ + 8×10⁰ = 2,748
| Power | 8⁵ 8×8×8×8×8 |
8⁴ 8×8×8×8 |
8³ 8×8×8 |
8² 8×8 |
8¹ 8×1 |
8⁰ 1 |
|---|---|---|---|---|---|---|
| Column Value | 32,768 | 4,096 | 512 | 64 | 8 | 1 |
| Octal Number | 5 | 2 | 7 | 4 | ||
| Sum of These Values (5274₈) | 2,560 | 128 | 56 | 4 |
(5274)₈ = 5×8³ + 2×8² + 7×8¹ + 4×8⁰ = 2,748₁₀
| Power | 16⁵ 16×16×16×16×16 |
16⁴ 16×16×16×16 |
16³ 16×16×16 |
16² 16×16 |
16¹ 16×1 |
16⁰ 1 |
|---|---|---|---|---|---|---|
| Column Value | 1,048,576 | 65,536 | 4,096 | 256 | 16 | 1 |
| Hexadecimal Number | A | B | C | |||
| Sum of These Values (ABC₁₆) | 2,560 | 176 | 12 |
(ABC)₁₆ = A×16² + B×16¹ + C×16⁰ = 10×256 + 11×16 + 12 = 2,748₁₀
| Power | 2¹⁰ 2×2×2×2×2×2×2×2×2×2 |
2⁹ 2×2×2×2×2×2×2×2×2 |
2⁸ 2×2×2×2×2×2×2×2 |
2⁷ 2×2×2×2×2×2×2 |
2⁶ 2×2×2×2×2×2 |
2⁵ 2×2×2×2×2 |
2⁴ 2×2×2×2 |
2³ 2×2×2 |
2² 2×2 |
2¹ 2 |
2⁰ 1 |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Column Value | 2048 | 1024 | 512 | 256 | 128 | 64 | 32 | 16 | 8 | 4 | 2 |
| Binary Number | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 1 | 1 | 1 | 0 |
| Sum of These Values (101010111100₂) | 2048 | 0 | 512 | 0 | 128 | 0 | 32 | 16 | 8 | 4 | 0 |
(101010111100)₂ = 2048 + 512 + 128 + 32 + 16 + 8 + 4 = 2,748₁₀
| Decimal | Binary | Octal | Hexadecimal |
|---|---|---|---|
| 0 | 0000 | 0 | 0 |
| 1 | 0001 | 1 | 1 |
| 2 | 0010 | 2 | 2 |
| 3 | 0011 | 3 | 3 |
| 4 | 0100 | 4 | 4 |
| 5 | 0101 | 5 | 5 |
| 6 | 0110 | 6 | 6 |
| 7 | 0111 | 7 | 7 |
| 8 | 1000 | 10 | 8 |
| 9 | 1001 | 11 | 9 |
| 10 | 1010 | 12 | A |
| 11 | 1011 | 13 | B |
| 12 | 1100 | 14 | C |
| 13 | 1101 | 15 | D |
| 14 | 1110 | 16 | E |
| 15 | 1111 | 17 | F |
Each system expresses the same values in different bases:
Binary (Base 2) · Octal (Base 8) · Decimal (Base 10) · Hexadecimal (Base 16)
| 1 Bit | Represents 0 or 1 |
| 1 Crumb | 2 bits → (00, 01, 10, 11) |
| 1 Nibble | 4 bits → (0000 - 1111) |
| 1 Byte | 8 bits |
| 1 Kilobyte (KB) | 1024 bytes |
| 1 Megabyte (MB) | 1024 kilobytes |
| 1 Gigabyte (GB) | 1024 megabytes |
| 1 Terabyte (TB) | 1024 gigabytes |
| 1 Petabyte (PB) | 1024 terabytes |
| 1 Exabyte (EB) | 1024 petabytes |
Data grows exponentially - every step up represents 1,024 times more information than the previous level.
ASCII (American Standard Code for Information Interchange)
The Extended ASCII set expands the standard 7-bit ASCII (0-127) to 8 bits (0-255).
- Supports additional symbols, graphics, and accented characters.
- Printable characters range from 32-126 and 128-255.
- Used by systems like IBM PC, Windows-1252, and ISO 8859-1 for extended language support.
| ASCII Codes - Decimal | 99 | 97 | 116 |
|---|---|---|---|
| c | a | t |
| ASCII Codes - Hexadecimal | 63 | 61 | 74 |
|---|---|---|---|
| c | a | t |
| ASCII Codes - Binary | 01100011 | 01100001 | 01110100 |
|---|---|---|---|
| c | a | t |
Each letter has a unique ASCII value, which can be expressed in decimal, hexadecimal, or binary.
Extended the number of characters. Worldwide standard for processing, displaying, and interchanging all types of language texts.
41h, Unicode: 4100hWhen searching for text data, always search in both Unicode and ASCII formats.
Base64: Encodes binary data into text for safe storage or transfer.
[A–Z][a–z][0–9][+/=]the cat sat on the mat → dGhlIGNhdCBzYXQgb24gdGhlIG1hdA==ROT13: Simple substitution cipher that rotates letters by 13 places.
A–Z a–zhello → uryybIt refers to the order in which a sequence of bytes is stored in a computer’s memory.

Many mainframe computers, particularly IBM mainframes, use a big-endian architecture.
Most modern computers, including PCs, use the little-endian system.
The more pixels an image has, the higher its resolution.
Motion = rapid display of still frames + synchronised sound.
.jpg, .doc, .pdf).
Extensionshelpusersidentify files, whileheadershelpsoftwareand forensic tools verify them.
| Extension | File Type / Description |
|---|---|
| .DOC | Microsoft Word document |
| .XLS | Microsoft Excel spreadsheet |
| .EXE | Executable program |
| .BAT | Batch command script |
| .JPG | JPEG graphic image |
| .GIF | GIF graphic image |
| .BMP | Bitmap graphic image |
| .DLL | Dynamic Link Library |
| .TXT | Plain text file |
| .ZIP | Compressed archive file |
File extensions give users a quick way to identify format and function.
Crucial for authenticating files in digital forensics.
Metadata - Digital data often includes metadata, which provides information about the characteristics, origin and structure of the data
| Extension | Header (Hex Signature) | Associated Program / Format |
|---|---|---|
| .DOC | D0 CF 11 E0 A1 B1 1A E1 00 | Microsoft Office Document |
| .EML | 46 72 6F 6D | Generic Email Message |
| .EXE | 4D 5A | Windows Executable |
| .GIF | 47 49 46 38 | Graphic Interchange Format |
| .JPG | FF D8 FF E0 | JPEG Image |
| .MOV | 73 6B 69 70 | QuickTime Movie |
| 25 50 44 46 | Adobe Acrobat File | |
| .ZIP | 50 4B 07 08 | PK Zip Archive |
The file header (magic number) reveals the true file type - even if the extension is changed.


magic number) to determine true content.Identify the actual file type.
Decide whether the file extension matches the file signature.
| File name | File signature (hex) | Actual file type (based on signature) | Extension match? |
|---|---|---|---|
photo.jpg |
FF D8 FF |
||
report.txt |
25 50 44 46 |
||
song.eni |
52 49 46 46 |
||
program.m |
4D 5A |
||
notes.docx |
50 4B 03 04 |
A hex editor (or binary/byte editor) is a program that allows manipulation of the fundamental binary data that makes up a computer file.
Useful resources for researching file headers, extensions, and magic numbers in forensic analysis.

Use case-relevant keywords only (e.g. names, usernames, suspect terms).
Avoid wasting resources on unnecessary keywords.
Searches may be case-sensitive, depending on the tool.


. |
Wildcard, matches any single character (except a full stop when escaped). |
\d |
Any digit, shorthand for [0–9]. |
+ |
Must be present one or more times. |
? |
May or may not be present (zero or one time). |
* |
Present zero, one, or multiple times. |
[abc] |
Matches one character from the set a, b, or c. |
[ea] |
Example: re[ea]d matches read and reed, but not red. |
[^abc] |
Must not match any character in the set. Example: re[^a]d matches red but not read. |
[a-z] |
Defines a range. Example: [0–9] matches any digit from 0 to 9. |
{x,y} |
Repeat the previous character between x and y times. |
A{2} |
Matches AA (repeat A exactly two times). |
a|b |
OR operator, matches a or b. |
\ |
Escape character, used to search for special characters (e.g. \. for a full stop). |
( ) |
Grouping, matches words as a unit. Example: (May) or (Jun). |



Remember – the operator only act on the preceding character

[^a-z]?Liz[^a-z]?
What does this expression find? \d{4}[- ]{3}:\d{3}
Write an expression for find email address for Roehampton
--- ## Computer Data - All computer data is fundamentally represented in **binary**. - Computers are **electronic devices**. - Electronic components represent data using **two distinct states**: - **Off / On** - **Current flowing / Not flowing** - **Switch open / Closed**