The document discusses approaches for computing unique visitors to web pages over time ranges while dealing with changing user ID mappings.
Initially, three approaches are presented using a key-value store: storing user IDs in sets indexed by URL and hour bucket (Approach 1), using HyperLogLogs for more efficient storage (Approach 2), and storing at multiple granularities to reduce lookups (Approach 3).
The problem is made harder by the presence of "equiv" records that map one user ID to another. Later approaches try to incrementally normalize user IDs, sample user IDs, or maintain separate indexes.
Ultimately, a hybrid approach is proposed using batch computation over the entire dataset to build robust indexes,