1 files changed, 236 insertions, 19 deletions
diff --git a/doc/development/geo.md b/doc/development/geo.md
index 18dffe42177..9e9bd85ecd8 100644
--- a/doc/development/geo.md
+++ b/doc/development/geo.md
@@ -19,11 +19,25 @@ Geo handles replication for different components:
 
 - [Database](#database-replication): includes the entire application, except cache and jobs.
 - [Git repositories](#repository-replication): includes both projects and wikis.
-- [Uploaded blobs](#uploads-replication): includes anything from images attached on issues
+- [Blobs](#blob-replication): includes anything from images attached on issues
   to raw logs and assets from CI.
 
 With the exception of the Database replication, on a *secondary* site, everything is coordinated
-by the [Geo Log Cursor](#geo-log-cursor).
+by the [Geo Log Cursor](#geo-log-cursor-daemon).
+
+### Replication states
+
+The following diagram illustrates how the replication works. Some allowed transitions are omitted for clarity.
+
+```mermaid
+stateDiagram-v2
+    Pending --> Started
+    Started --> Synced
+    Started --> Failed
+    Synced --> Pending: Mark for resync
+    Failed --> Pending: Mark for resync
+    Failed --> Started: Retry
+```
 
 ### Geo Log Cursor daemon
 
@@ -66,7 +80,7 @@ the state of every repository in the [tracking database](#tracking-database).
 There are a few ways a repository gets replicated by the:
 
 - [Repository Sync worker](#repository-sync-worker).
-- [Geo Log Cursor](#geo-log-cursor).
+- [Geo Log Cursor](#geo-log-cursor-daemon).
 
 #### Project Registry
 
@@ -97,26 +111,211 @@ projects that need updating. Those projects can be:
   timestamp that is more recent than the `last_repository_successful_sync_at`
   timestamp in the `Geo::ProjectRegistry` model.
 - Manual: The administrator can manually flag a repository to resync in the
-  [Geo Admin Area](../user/admin_area/geo_nodes.md).
+  [Geo Admin Area](../user/admin_area/geo_sites.md).
 
 When we fail to fetch a repository on the secondary `RETRIES_BEFORE_REDOWNLOAD`
 times, Geo does a so-called _re-download_. It will do a clean clone
 into the `@geo-temporary` directory in the root of the storage. When
 it's successful, we replace the main repository with the newly cloned one.
 
-### Uploads replication
+### Blob replication
+
+Blobs such as [uploads](uploads/index.md), LFS objects, and CI job artifacts, are replicated to the **secondary** site with the [Self-Service Framework](geo/framework.md). To track the state of syncing, each model has a corresponding registry table, for example `Upload` has `Geo::UploadRegistry` in the [PostgreSQL Geo Tracking Database](#tracking-database).
 
-File uploads are also being replicated to the **secondary** site. To
-track the state of syncing, the `Geo::UploadRegistry` model is used.
+#### Blob replication happy path workflows between services
+
+Job artifacts are used in the diagrams below, as one example of a blob.
+
+##### Replicating a new job artifact
+
+Primary site:
+
+```mermaid
+sequenceDiagram
+  participant R as Runner
+  participant P as Puma
+  participant DB as PostgreSQL
+  participant SsP as Secondary site PostgreSQL
+  R->>P: Upload artifact
+  P->>DB: Insert `ci_job_artifacts` row
+  P->>DB: Insert `geo_events` row
+  P->>DB: Insert `geo_event_log` row
+  DB->>SsP: Replicate rows
+```
 
-#### Upload Registry
+- A [Runner](https://docs.gitlab.com/runner/) uploads an artifact
+- [Puma](architecture.md#puma) inserts `ci_job_artifacts` row
+- Puma inserts `geo_events` row with data like "Job Artifact with ID 123 was updated"
+- Puma inserts `geo_event_log` row pointing to the `geo_events` row (because we built SSF on top of some legacy logic)
+- [PostgreSQL](architecture.md#postgresql) streaming replication inserts the rows in the read replica
 
-Similar to the [Project Registry](#project-registry), there is a
-`Geo::UploadRegistry` model that tracks the synced uploads.
+Secondary site, after the PostgreSQL DB rows have been replicated:
+
+```mermaid
+sequenceDiagram
+  participant DB as PostgreSQL
+  participant GLC as Geo Log Cursor
+  participant R as Redis
+  participant S as Sidekiq
+  participant TDB as PostgreSQL Tracking DB
+  participant PP as Primary site Puma
+  GLC->>DB: Query `geo_event_log`
+  GLC->>DB: Query `geo_events`
+  GLC->>R: Enqueue `Geo::EventWorker`
+  S->>R: Pick up `Geo::EventWorker`
+  S->>TDB: Insert to `job_artifact_registry`, "starting sync"
+  S->>PP: GET <primary site internal URL>/geo/retrieve/job_artifact/123
+  S->>TDB: Update `job_artifact_registry`, "synced"
+```
+
+- [Geo Log Cursor](#geo-log-cursor-daemon) loop finds the new `geo_event_log` row
+- Geo Log Cursor processes the `geo_events` row
+  - Geo Log Cursor enqueues `Geo::EventWorker` job passing through the `geo_events` row data
+- [Sidekiq](architecture.md#sidekiq) picks up `Geo::EventWorker` job
+  - Sidekiq inserts `job_artifact_registry` row in the [PostgreSQL Geo Tracking Database](#tracking-database) because it doesn't exist, and marks it "started sync"
+  - Sidekiq does a GET request on an API endpoint at the primary Geo site and downloads the file
+  - Sidekiq marks the `job_artifact_registry` row as "synced" and "pending verification"
+
+##### Backfilling existing job artifacts
+
+- Sysadmin has an existing GitLab site without Geo
+- There are existing CI jobs and job artifacts
+- Sysadmin sets up a new GitLab site and configures it to be a secondary Geo site
+
+Secondary site:
+
+There are two cronjobs running every minute: `Geo::Secondary::RegistryConsistencyWorker` and `Geo::RegistrySyncWorker`. The workflow below is split into two, along those lines.
+
+```mermaid
+sequenceDiagram
+  participant SC as Sidekiq-cron
+  participant R as Redis
+  participant S as Sidekiq
+  participant DB as PostgreSQL
+  participant TDB as PostgreSQL Tracking DB
+  SC->>R: Enqueue `Geo::Secondary::RegistryConsistencyWorker`
+  S->>R: Pick up `Geo::Secondary::RegistryConsistencyWorker`
+  S->>DB: Query `ci_job_artifacts`
+  S->>TDB: Query `job_artifact_registry`
+  S->>TDB: Insert to `job_artifact_registry`
+```
 
-CI Job Artifacts and LFS objects are synced in a similar way as uploads,
-but they are tracked by `Geo::JobArtifactRegistry`, and `Geo::LfsObjectRegistry`
-models respectively.
+- [Sidekiq-cron](https://github.com/ondrejbartas/sidekiq-cron) enqueues a `Geo::Secondary::RegistryConsistencyWorker` job every minute. As long as it is actively doing work (creating and deleting rows), this job immediately reenqueues itself. This job uses an exclusive lease to prevent multiple instances of itself from running simultaneously.
+- [Sidekiq](architecture.md#sidekiq) picks up `Geo::Secondary::RegistryConsistencyWorker` job
+  - Sidekiq queries `ci_job_artifacts` table for up to 10000 rows
+  - Sidekiq queries `job_artifact_registry` table for up to 10000 rows
+  - Sidekiq inserts a `job_artifact_registry` row in the [PostgreSQL Geo Tracking Database](#tracking-database) corresponding to the existing Job Artifact
+
+```mermaid
+sequenceDiagram
+  participant SC as Sidekiq-cron
+  participant R as Redis
+  participant S as Sidekiq
+  participant DB as PostgreSQL
+  participant TDB as PostgreSQL Tracking DB
+  participant PP as Primary site Puma
+  SC->>R: Enqueue `Geo::RegistrySyncWorker`
+  S->>R: Pick up `Geo::RegistrySyncWorker`
+  S->>TDB: Query `*_registry` tables
+  S->>R: Enqueue `Geo::EventWorker`s
+  S->>R: Pick up `Geo::EventWorker`
+  S->>TDB: Insert to `job_artifact_registry`, "starting sync"
+  S->>PP: GET <primary site internal URL>/geo/retrieve/job_artifact/123
+  S->>TDB: Update `job_artifact_registry`, "synced"
+```
+
+- [Sidekiq-cron](https://github.com/ondrejbartas/sidekiq-cron) enqueues a `Geo::RegistrySyncWorker` job every minute. As long as it is actively doing work, this job loops for up to an hour scheduling sync jobs. This job uses an exclusive lease to prevent multiple instances of itself from running simultaneously.
+- [Sidekiq](architecture.md#sidekiq) picks up `Geo::RegistrySyncWorker` job
+  - Sidekiq queries all `registry` tables in the [PostgreSQL Geo Tracking Database](#tracking-database) for "never attempted sync" rows. It interleaves rows from each table and adds them to an in-memory queue.
+  - If the previous step yielded less than 1000 rows, then Sidekiq queries all `registry` tables for "failed sync and ready to retry" rows and interleaves those and adds them to the in-memory queue.
+  - Sidekiq enqueues `Geo::EventWorker` jobs with arguments like "Job Artifact with ID 123 was updated" for each item in the queue, and tracks the enqueued Sidekiq job IDs.
+  - Sidekiq stops enqueuing `Geo::EventWorker` jobs when "maximum concurrency limit" settings are reached
+  - Sidekiq loops doing this kind of work until it has no more to do
+- Sidekiq picks up `Geo::EventWorker` job
+  - Sidekiq marks the `job_artifact_registry` row as "started sync"
+  - Sidekiq does a GET request on an API endpoint at the primary Geo site and downloads the file
+  - Sidekiq marks the `job_artifact_registry` row as "synced" and "pending verification"
+
+##### Verifying a new job artifact
+
+Primary site:
+
+```mermaid
+sequenceDiagram
+  participant Ru as Runner
+  participant P as Puma
+  participant DB as PostgreSQL
+  participant SC as Sidekiq-cron
+  participant Rd as Redis
+  participant S as Sidekiq
+  participant F as Filesystem
+  Ru->>P: Upload artifact
+  P->>DB: Insert `ci_job_artifacts`
+  P->>DB: Insert `ci_job_artifact_states`
+  SC->>Rd: Enqueue `Geo::VerificationCronWorker`
+  S->>Rd: Pick up `Geo::VerificationCronWorker`
+  S->>DB: Query `ci_job_artifact_states`
+  S->>Rd: Enqueue `Geo::VerificationBatchWorker`
+  S->>Rd: Pick up `Geo::VerificationBatchWorker`
+  S->>DB: Query `ci_job_artifact_states`
+  S->>DB: Update `ci_job_artifact_states` row, "started"
+  S->>F: Checksum file
+  S->>DB: Update `ci_job_artifact_states` row, "succeeded"
+```
+
+- A [Runner](https://docs.gitlab.com/runner/) uploads an artifact
+- [Puma](architecture.md#puma) creates a `ci_job_artifacts` row
+- Puma creates a `ci_job_artifact_states` row to store verification state.
+  - The row is marked "pending verification"
+- [Sidekiq-cron](https://github.com/ondrejbartas/sidekiq-cron) enqueues a `Geo::VerificationCronWorker` job every minute
+- [Sidekiq](architecture.md#sidekiq) picks up the `Geo::VerificationCronWorker` job
+  - Sidekiq queries `ci_job_artifact_states` for the number of rows marked "pending verification" or "failed verification and ready to retry"
+  - Sidekiq enqueues one or more `Geo::VerificationBatchWorker` jobs, limited by the "maximum verification concurrency" setting
+- Sidekiq picks up `Geo::VerificationBatchWorker` job
+  - Sidekiq queries `ci_job_artifact_states` for rows marked "pending verification"
+  - If the previous step yielded less than 10 rows, then Sidekiq queries `ci_job_artifact_states` for rows marked "failed verification and ready to retry"
+  - For each row
+    - Sidekiq marks it "started verification"
+    - Sidekiq gets the SHA256 checksum of the file
+    - Sidekiq saves the checksum in the row and marks it "succeeded verification"
+    - Now secondary Geo sites can compare against this checksum
+
+Secondary site:
+
+```mermaid
+sequenceDiagram
+  participant SC as Sidekiq-cron
+  participant R as Redis
+  participant S as Sidekiq
+  participant TDB as PostgreSQL Tracking DB
+  participant F as Filesystem
+  participant DB as PostgreSQL
+  SC->>R: Enqueue `Geo::VerificationCronWorker`
+  S->>R: Pick up `Geo::VerificationCronWorker`
+  S->>TDB: Query `job_artifact_registry`
+  S->>R: Enqueue `Geo::VerificationBatchWorker`
+  S->>R: Pick up `Geo::VerificationBatchWorker`
+  S->>TDB: Query `job_artifact_registry`
+  S->>TDB: Update `job_artifact_registry` row, "started"
+  S->>F: Checksum file
+  S->>DB: Query `ci_job_artifact_states`
+  S->>TDB: Update `job_artifact_registry` row, "succeeded"
+```
+
+- After the artifact is successfully synced, it becomes "pending verification"
+- [Sidekiq-cron](https://github.com/ondrejbartas/sidekiq-cron) enqueues a `Geo::VerificationCronWorker` job every minute
+- [Sidekiq](architecture.md#sidekiq) picks up the `Geo::VerificationCronWorker` job
+  - Sidekiq queries `job_artifact_registry` in the [PostgreSQL Geo Tracking Database](#tracking-database) for the number of rows marked "pending verification" or "failed verification and ready to retry"
+  - Sidekiq enqueues one or more `Geo::VerificationBatchWorker` jobs, limited by the "maximum verification concurrency" setting
+- Sidekiq picks up `Geo::VerificationBatchWorker` job
+  - Sidekiq queries `job_artifact_registry` in the PostgreSQL Geo Tracking Databasef for rows marked "pending verification"
+  - If the previous step yielded less than 10 rows, then Sidekiq queries `job_artifact_registry` for rows marked "failed verification and ready to retry"
+  - For each row
+    - Sidekiq marks it "started verification"
+    - Sidekiq gets the SHA256 checksum of the file
+    - Sidekiq saves the checksum in the row
+    - Sidekiq compares the checksum against the checksum in the `ci_job_artifact_states` row which was replicated by PostgreSQL
+    - If the checksum matches, then Sidekiq marks the `job_artifact_registry` row "succeeded verification"
 
 ## Authentication
 
@@ -241,6 +440,22 @@ ignores items in object storage. Either:
 
 ## Verification
 
+### Verification states
+
+The following diagram illustrates how the verification works. Some allowed transitions are omitted for clarity.
+
+```mermaid
+stateDiagram-v2
+    Pending --> Started
+    Pending --> Disabled: No primary checksum
+    Disabled --> Started: Primary checksum succeeded
+    Started --> Succeeded
+    Started --> Failed
+    Succeeded --> Pending: Mark for reverify
+    Failed --> Pending: Mark for reverify
+    Failed --> Started: Retry
+```
+
 ### Repository verification
 
 Repositories are verified with a checksum.
@@ -252,7 +467,12 @@ basically hashes all Git refs together and stores that hash in the
 The **secondary** site does the same to calculate the hash of its
 clone, and compares the hash with the value the **primary** site
 calculated. If there is a mismatch, Geo will mark this as a mismatch
-and the administrator can see this in the [Geo Admin Area](../user/admin_area/geo_nodes.md).
+and the administrator can see this in the [Geo Admin Area](../user/admin_area/geo_sites.md).
+
+## Geo proxying
+
+Geo secondaries can proxy web requests to the primary.
+Read more on the [Geo proxying (development) page](geo/proxying.md).
 
 ## Glossary
 
@@ -303,10 +523,7 @@ events include:
 - Job Artifact Deleted event
 - Upload Deleted event
 
-### Geo Log Cursor
-
-The process running on the **secondary** site that looks for new
-`Geo::EventLog` rows.
+See [Geo Log Cursor daemon](#geo-log-cursor-daemon).
 
 ## Code features
 
@@ -415,7 +632,7 @@ We switch and filter from each event by the `event_name` field.
 ### Geo Log Cursor (GitLab 10.0 and up)
 
 In GitLab 10.0 and later, [System Webhooks](#system-hooks-gitlab-87-to-95) are no longer
-used and Geo Log Cursor is used instead. The Log Cursor traverses the
+used and [Geo Log Cursor](#geo-log-cursor-daemon) is used instead. The Log Cursor traverses the
 `Geo::EventLog` rows to see if there are changes since the last time
 the log was checked and will handle repository updates, deletes,
 changes, and renames.