Getting started with measurements

An accessible introduction to planning and running simple Internet measurements.

1. Define your question

Start with a clear research or operational question: what do you want to measure and why? Narrow the scope to a measurable outcome (e.g., latency differences between two regions, reachability of a service, or the distribution of response codes). Write a short plan that specifies the metrics you'll collect, the time window, and what would count as a meaningful result.

2. Choose metrics and vantage points

Decide which metrics best answer your question (round-trip time, packet loss, traceroute hops, HTTP status codes, DNS resolution times). Equally important are vantage points: will you run probes from a single host, multiple cloud regions, or volunteer devices? Ensure your sample covers the population you care about.

3. Choose tools

Pick lightweight, well-understood tools. For many tasks traceroute and ping suffice; for HTTP checks, use curl or a small Python script (requests). When you need repeatability, use a measurement platform or orchestrate probes via cron/jobs. Prefer open-source tools that are transparent and maintained.

4. Sampling strategy and rate-limiting

Choose a sampling frequency that balances signal and impact. Avoid high-frequency scanning that may resemble abuse; if probing third-party networks, rate-limit and add descriptive User-Agent strings where applicable. Log precise timestamps (UTC) and record probe identifiers so results can be traced back to runs.

5. Run small pilots

Always pilot with a tiny set of probes and review results for unexpected behaviour. Use the pilot to validate parsing code, confirm timestamps line up, and check that your probes are not causing side-effects. Document anomalies and adapt the plan before scaling up.

6. Analysis checklist

Clean and normalise timestamps and addresses.
Identify and filter obvious outliers caused by transient failures.
Visualise distributions (CDFs for latency, histograms for counts).
Report uncertainties and sample sizes alongside any claims.