Privacy Budget: Code Locations
Following on from the high level Privacy
Budget explainer, the current
implementation focuses on measuring the identifiability of various web exposed
features. Hence the word Identifiability occurs often in the code and
documentation.
This document focuses on code layout for Privacy Budget. Concepts and background for the identifiability study are out of scope.
TODO(asanka): Link to study documents once they are checked in.
Core Metrics and Aggregation
Locations:
Includes:
-
Core logic and primitives for constructing identifiability metrics.
This is what one would use when reporting identifiability study samples. Centralized logic makes it easier to construct consistent and stable samples. All logic for supporting the construction of
blink::IdentifiableTokenvalues fromstd::types andbase::types should go here. -
Per-process aggregation of metrics.
Aggregation minimizes the amount of information being communicated across process boundaries.
The code in this directory is shared across //content, //chrome, and
//third_party/blink. Hence its placement in blink/public/common.
In addition, this directory also contains logic for per-process aggregation of metrics so that they can be efficiently communicated across process boundaries.
Metrics Calculation for types visible to blink/renderer/platform
Locations:
Functions for constructing blink::IdentifiableToken values from
platform/wtf types. E.g. blink::String.
See the DEPS in that directory for the paths that this
component can depend on. In particular:
blink/renderer/platformcan't depend onmodules/orcore/which means that types from those source locations will need to be supported elsewhere.blink/renderer/platform/foocan depend on other features underplatform/. So it would be possible to add support for types inplatform/in this directory.
Metrics calculation for types used in bindings based instrumentation
Bindings based instrumentation is discussed in Annotating Direct Surfaces vis WebIDL Bindings.
The generated bindings invoke Dactyloscoper::RecordDirectSurface() overrides
for sampling and reporting. Hence support for types visible to
renderer/core/frame lives in
dactyloscoper.cc/.h
Static study settings
Locations:
Logic for accessing per-session settings based on externally supplied field
trial configurations. The full set of externally controlled settings are
in
privacy_budget_features.h.
At a high level, these settings control such things as:
- Whether the study is active.
- Which identifiable surfaces should not be sampled.
- Parameters for how surfaces are selected for sampling.
Both the browser and the renderer need to access these settings. The browser needs them for filtering and reporting. The renderer needs them to avoid sampling surfaces where sampling itself is harmful for performance or stability reasons.
Persistent study state and reporting
Locations:
Per-client state is primarily used and exposed by IdentifiabilityStudyState
(Source).