Allan Talver
11/10/2022, 2:36 PMJoseph Heun
11/10/2022, 4:44 PMJakub Sterba
11/10/2022, 9:23 PMSELECT COUNT(Client ID,Client ID)
- returns number of unique primary keys in Clients dataset
• SELECT COUNT(Client ID,Invoice ID)
- returns number of unique foreign keys in Invoices dataset with Invoice ID set as connection point/primary key
Difference of these two metrics is number of client_ids which do not exist in Clients dataset, but it is probably not feasible to list them.
I suggest to add records to Clients dataset even for clients which do not exist in CRM yet in data warehouse so that referential integrity is not broken and update such records with information from CRM as soon as it is available. You can also add some attribute like “CRM Status” to be able to filter records which exist in CRM or do not exist in CRM easily by end users using GoodData.
Other option is to load into Invoices dataset the Client ID which cannot be mapped to any Client ID in Client Dataset as some additional attribute (e.g. Unmapped Client ID). Such field would have to be created in the data warehouse via ETL/ELT.Jakub Sterba
11/10/2022, 9:36 PM