counterweight/data-dwh-dbt-project

Author	SHA1	Message	Date
Oriol Roqué Paniagua	d475285461	Merged PR 3221: Adapted lifecycle logic for deals to include offboardings # Description Adapts deals lifecycle logic by including offboardings from hubspot. It mostly increases the number of churning and inactive states in decrement of active state. I also updated documentation and added an accepted values test. When deploying and refreshing prod, figures in main kpis will be impacted # Checklist - [X] The edited models and dependants run properly with production data. - [X] The edited models are sufficiently documented. - [X] The edited models contain PK tests, and I've ran and passed them. - [NA] I have checked for DRY opportunities with other models and docs. - [NA] I've picked the right materialization for the affected models. # Other - [ ] Check if a full-refresh is required after this PR is merged. Related work items: #22689	2024-10-17 09:41:37 +00:00
Oriol Roqué Paniagua	3faf03c139	Merged PR 3207: Cast as dates and timestamps for hubspot deals in staging # Description Changes data type of dates and timestamps of relevant fields in stg_hubspot__deals # Checklist - [X] The edited models and dependants run properly with production data. - [X] The edited models are sufficiently documented. - [X] The edited models contain PK tests, and I've ran and passed them. - [NA] I have checked for DRY opportunities with other models and docs. - [NA] I've picked the right materialization for the affected models. # Other - [ ] Check if a full-refresh is required after this PR is merged. Related work items: #22689	2024-10-16 14:57:36 +00:00
Pablo Martin	7f2ca34301	deprecate model and remove docs	2024-10-16 16:21:35 +02:00
Pablo Martin	2877887fae	update ref in exposure	2024-10-16 16:17:47 +02:00
Pablo Martin	9ff2f2cdc5	Merge branch 'master' of ssh.dev.azure.com:v3/guardhog/Data/data-dwh-dbt-project	2024-10-16 14:20:27 +02:00
Pablo Martin	bd7a54050f	split edeposit source	2024-10-16 14:20:24 +02:00
Oriol Roqué Paniagua	b0c50d4da2	Merged PR 3194: First version of Hubspot deals in intermediate # Description Very minimal, first version of Hubspot deals. I intentionally didn't include here sales related information or additional attributes since I don't need them for Churn related topics. This can be done in future PRs. This Deal version includes the name of the Hubspot pipeline and the Stage. It also excludes deals assigned to Guardhog pipeline (~3k) based on my discussion with Alex. # Checklist - [X] The edited models and dependants run properly with production data. - [X] The edited models are sufficiently documented. - [X] The edited models contain PK tests, and I've ran and passed them. - [X] I have checked for DRY opportunities with other models and docs. - [X] I've picked the right materialization for the affected models. Left as a view since there's not that many records, can be changed in the future if needed # Other - [ ] Check if a full-refresh is required after this PR is merged. Related work items: #22689	2024-10-16 12:15:14 +00:00
Oriol Roqué Paniagua	004616bb79	Merged PR 3187: Move deal lifecycle related models to cross # Description Moves from intermediate/core to intermediate/cross the following models: - `int_core__mtd_deal_lifecycle` - `int_core__mtd_deal_metrics` to their equivalents: - `int_mtd_deal_lifecycle` - `int_mtd_deal_metrics` This also changes the schema entries, from core to cross, including changing the name of the model in the entry. This also changes the dependencies, namely in `int_mtd_deal_metrics`, `int_mtd_vs_previous_year_metrics` and `int_monthly_aggregated_metrics_history_by_deal`. This does NOT aim to alter the logic of the lifecycle in any case; it will be done in a separated PR. Runs correctly end-to-end. We might need to drop the old models from production manually. # Checklist - [X] The edited models and dependants run properly with production data. - [X] The edited models are sufficiently documented. - [X] The edited models contain PK tests, and I've ran and passed them. - [X] I have checked for DRY opportunities with other models and docs. - [X] I've picked the right materialization for the affected models. # Other - [ ] Check if a full-refresh is required after this PR is merged. Related work items: #22689	2024-10-16 11:56:49 +00:00
Pablo Martin	c0e411ee12	remove duplicate deals entry	2024-10-15 18:03:31 +02:00
Oriol Roqué Paniagua	35b6472b48	Merged PR 3183: Bugfixes on top losers # Description - Bugfix on nullif then 0 - it was applied to the numerator of Revenue computation, which made MoM growth be considered as null and propagated as 0 in the scores, which is not true. - Bugfix on cast as numeric - this was introduced because PBI didn't read well some decimal figures when loading the data. However this impacted somehow in the score by some weird magic I don't understand. I just replace the casts by rounds, that are applied after the computation of the scores, and PBI seems happy with it. # Checklist - [X] The edited models and dependants run properly with production data. - [X] The edited models are sufficiently documented. - [X] The edited models contain PK tests, and I've ran and passed them. - [NA] I have checked for DRY opportunities with other models and docs. - [NA] I've picked the right materialization for the affected models. # Other - [ ] Check if a full-refresh is required after this PR is merged. Related work items: #22635	2024-10-15 15:51:04 +00:00
Pablo Martín	da8c2e9fab	Merged PR 3177: Deals with properties # Description Expands the Deals staging model by adding tons of properties. # Checklist - [X] The edited models and dependants run properly with production data. - [X] The edited models are sufficiently documented. - [X] The edited models contain PK tests, and I've ran and passed them. - [X] I have checked for DRY opportunities with other models and docs. - [X] I've picked the right materialization for the affected models. # Other - [ ] Check if a full-refresh is required after this PR is merged. Related work items: #22225	2024-10-15 14:05:06 +00:00
Pablo Martin	d9a90d7d24	partial schema. Not doing more because deals is a monster	2024-10-15 15:41:20 +02:00
Pablo Martin	e5b18f2ce7	rename col	2024-10-15 14:37:10 +02:00
Pablo Martin	7e9248aa6e	added properties	2024-10-15 14:37:10 +02:00
Oriol Roqué Paniagua	61339a7d58	Merged PR 3171: Improvements on monthly growth score by deal # Description Main changes: * Includes 4 new fields to take into account 12 month created bookings. Specifically: `deal_created_bookings_12_months_window` `global_created_bookings_12_months_window` `deal_contribution_share_to_global_created_bookings` `deal_contribution_rank_to_global_created_bookings` This also renames a CTE, that was previously stating it was revenue. Same for inline comments. Also includes documentation of this fields. * Score range modification: Now, growth scores are multiplied by 100 and weighted score by 1000. This makes it easier to display and understand (Growth cannot be less than -100, threshold value is now -1, 0 and 1). I checked that the content already in production has not change (ex: we still have the same 15 top losers for September). # Checklist - [X] The edited models and dependants run properly with production data. - [X] The edited models are sufficiently documented. - [X] The edited models contain PK tests, and I've ran and passed them. - [NA] I have checked for DRY opportunities with other models and docs. - [NA] I've picked the right materialization for the affected models. # Other - [ ] Check if a full-refresh is required after this PR is merged. Related work items: #22635	2024-10-15 12:31:39 +00:00
Oriol Roqué Paniagua	9440e6d624	Merged PR 3169: Adding Churn metrics to Global KPIs # Description Main changes: - Creation of `int_monthly_churn_metrics` model. This follows a similar approach as for mtd models, with jinja loops to aggregate the metrics at different dimensions. This reads from the previous monthly model, thus creating a dependency on Global KPIs with By Deal KPIs. - Adds the 6 metrics in the main aggregated model of Global KPIs `int_mtd_vs_previous_year_metrics`. It doesn't necessarily mean that the 6 metrics will be made available in the report. This does NOT display these metrics in the report, and won't be done until deal lifecycle is enriched to consider hubspot offboarding in the state "05-Churning". # Checklist - [X] The edited models and dependants run properly with production data. - [X] The edited models are sufficiently documented. - [X] The edited models contain PK tests, and I've ran and passed them. - [X] I have checked for DRY opportunities with other models and docs. - [X] I've picked the right materialization for the affected models. # Other - [ ] Check if a full-refresh is required after this PR is merged. Related work items: #22691	2024-10-15 10:46:56 +00:00
Oriol Roqué Paniagua	901be930df	Merged PR 3163: First version of 12m window contribution by deal # Description This PR creates a new model that depends on int_monthly_aggregated_metrics_history_by_deal. The idea is that this is used for Churn computation (Booking Churn, Revenue Churn, Listing Churn) later on. The idea is relatively simple. Measure how much a Deal has been contributing to a Global amount (sum of metric for all deals) over the preceding period of 12 months. You will notice that there's 2 computations, the "additive" and the "average" one. This is because we still need to align with Matt/Suzannah on which approach makes more sense, but we need data for it. I'm not sure the namings are good though so happy to see your suggestions. You will also notice that there's no filter by deal_lifecycle_state = '06-Churning'. This will be done in a separated model, whenever we attribute this model to the mtd computation. The reason is simple - this model stays at deal level, thus meaning we can do the dimension aggregation and even a lifecycle aggregation if needed, depending on the needs. Be aware that this effectively means that MTD KPIs models will depend on the "monthly by deal" models. This has some cons in terms of dependency management but cannot be overcome since we the metric total revenue depends on many subsets. In essence, I don't see another way of doing it unless doing a massive KPIs refactor. I prefer to wait until the Product KPIs discussions are finished and then we see how we approach it. # Checklist - [X] The edited models and dependants run properly with production data. - [X] The edited models are sufficiently documented. - [X] The edited models contain PK tests, and I've ran and passed them. - [X] I have checked for DRY opportunities with other models and docs. - [X] I've picked the right materialization for the affected models. # Other - [ ] Check if a full-refresh is required after this PR is merged. Related work items: #22691	2024-10-15 06:51:41 +00:00
uri	ca24836395	creating exposures entry for top_losers report	2024-10-14 16:46:31 +02:00
Oriol Roqué Paniagua	eb213acb9e	Merged PR 3137: Growth score to reporting # Description Copies intermediate to reporting for growth score by deal. Schema is copy-paste from intermediate changing the model's name. # Checklist - [X] The edited models and dependants run properly with production data. - [X] The edited models are sufficiently documented. - [X] The edited models contain PK tests, and I've ran and passed them. - [NA] I have checked for DRY opportunities with other models and docs. - [X] I've picked the right materialization for the affected models. # Other - [ ] Check if a full-refresh is required after this PR is merged. Related work items: #22635	2024-10-14 12:26:01 +00:00
Pablo Martin	ee0d60382d	more schemas	2024-10-14 11:55:44 +02:00
Pablo Martin	5c64bf8b20	schema and final touches for deal pipeline	2024-10-14 11:47:44 +02:00
Pablo Martin	b9958129bc	stages model	2024-10-14 11:37:42 +02:00
Pablo Martin	c0718daaa9	deal pipeline	2024-10-14 11:10:07 +02:00
Oriol Roqué Paniagua	7f9c038fc0	Merged PR 3120: Creation of growth score by deal model for top losers report (intermediate) # Description Creates a model to identify deal growth based on YoY performance of Created Bookings, YoY performance of Listings Booked in Month and one month shifted YoY performance of Revenue. Also added weighted score to account for revenue size. # Checklist - [X] The edited models and dependants run properly with production data. - [X] The edited models are sufficiently documented. - [X] The edited models contain PK tests, and I've ran and passed them. - [ ] I have checked for DRY opportunities with other models and docs. Probably something can be done here, sorry I've not checked. - [X] I've picked the right materialization for the affected models. # Other - [ ] Check if a full-refresh is required after this PR is merged. Related work items: #22635	2024-10-11 07:20:35 +00:00
Oriol Roqué Paniagua	52f01adc11	Merged PR 3127: (3/3) Revenue renaming - KPIs by deal # Description Main changes: * Guest revenue is now guest payments. PBI uses Guest revenue, so alias is changed at reporting level, while it uses guest_payments_in_gbp field. * Removal of Waiver Amount Paid back to Host to Guest revenue and Total revenue. # Checklist - [X] The edited models and dependants run properly with production data. - [NA] The edited models are sufficiently documented. - [X] The edited models contain PK tests, and I've ran and passed them. - [NA] I have checked for DRY opportunities with other models and docs. - [NA] I've picked the right materialization for the affected models. # Other - [ ] Check if a full-refresh is required after this PR is merged. Related work items: #22688	2024-10-10 14:01:49 +00:00
Oriol Roqué Paniagua	745f00bad2	Merged PR 3124: 1/3 - Revenue renaming Main KPIs - MTD scope # Description Adapts revenue figures in Main KPIs - MTD scope or global view. This includes MTD, Monthly Overview, Global Evolution over Time, Detail by Category. In essence, everything that is not by deal. The changes are mainly 2: * Remove the line that deducts the `Waiver Amount Paid Back to Hosts` in all metrics except the `Waiver Net Fees`. This effectively means that the previous `Guest Revenue` = `Guest Payments`, thus I dropped all 3 `Guest Payments` metrics. * Do a renaming at metric display level, but not in the code. This means that I remove the computation of `guest_revenue_in_gbp` for instance and keep `guest_payments_in_gbp`, and apply the renaming later on, since the modelisation already accounts for defining metric names differently from those of the fields. For the rest of metrics, I revised all metrics name and did changes based on the [whiteboard](https://whiteboard.office.com/me/whiteboards/p/c3BvOmh0dHBzOi8vZ3VhcmRob2ctbXkuc2hhcmVwb2ludC5jb20vcGVyc29uYWwvcGFibG9fbWFydGluX3N1cGVyaG9nX2NvbQ%3d%3d/b!T2D3opQuBECSDnhuFZrUacFu3TxvSvdIsnI4Dxsh2IuaB1AigbciRqkqte61I4wz/01H5SI4J4L7HTPJGUT7JGYKTOSQYYWACXU). I also changed the dedicated data tests in Main KPIs to ensure it's working. I also changed the exclusion logic in reporting based on the name of the metric to not display metrics that depend on the invoicing cycle unless it's 2 months ago or before. To keep in mind: * Merging this will automatically display the new figures/naming in production. Might be wise to communicate to stakeholders since some key metrics (namely, Guest Revenue / Total Revenue) will change the meaning. * We also need to do these changes in the metrics by deal part of the computation. I'd do first the removal of these fields in the PBI report (and take the opportunity to change the Data Catalogue) and then do the PR in DWH to change the logic. Before that though let's check that the names included in this PR are the correct ones :) # Checklist - [X] The edited models and dependants run properly with production data. - [NA] The edited models are sufficiently documented. - [X] The edited models contain PK tests, and I've ran and passed them. - [NA] I have checked for DRY opportunities with other models and docs. - [NA] I've picked the right materialization for the affected models. # Other - [ ] Check if a full-refresh is required after this PR is merged. Related work items: #22688	2024-10-10 13:46:59 +00:00
Pablo Martin	936b4878cf	point exposure to the table that PBI actually uses	2024-10-09 17:54:32 +02:00
Pablo Martin	a1a68d236b	removed incorrect uniqueness test	2024-10-09 17:41:43 +02:00
uri	7302790d42	Add new API role CancellationApi in user_role accepted values	2024-10-09 10:17:34 +02:00
Pablo Martin	09fe9ded77	signal model deprecation more clearly, unlock a few tests	2024-10-08 17:44:31 +02:00
Pablo Martin	ea4274b37c	missing docs	2024-10-08 17:25:16 +02:00
Pablo Martin	2d42d99543	docs for athena int	2024-10-08 14:54:11 +02:00
Pablo Martin	c81f6c955d	move docs for int_edeposit__guesty_verifications	2024-10-08 14:45:50 +02:00
Pablo Martin	ba29afa3bd	docs for athena staging	2024-10-08 14:41:17 +02:00
Pablo Martin	13819c9f99	update edeposit staging	2024-10-08 14:39:25 +02:00
Pablo Martin	d3de03890c	more todos	2024-10-08 14:37:00 +02:00
Pablo Martin	d21d9b6a01	todos	2024-10-08 14:37:00 +02:00
Pablo Martin	3dbeaf0575	reporting model for athena	2024-10-08 14:37:00 +02:00
Pablo Martin	654255321f	fix typo in version	2024-10-08 14:37:00 +02:00
Pablo Martin	d16647ac72	push upstream the filter	2024-10-08 14:37:00 +02:00
Pablo Martin	67dcc8b237	split staging layer	2024-10-08 14:37:00 +02:00
Pablo Martin	761cf409c6	push filter down one model	2024-10-08 14:37:00 +02:00
Pablo Martin	30c73b1ab9	duplicate athena verifications	2024-10-08 14:37:00 +02:00
Pablo Martin	67b8e1263d	move guesty to athena folder	2024-10-08 14:37:00 +02:00
Oriol Roqué Paniagua	b70f426f8e	Merged PR 3089: Remove custom pricing/protection # Description This PR aims to remove custom pricing/protection fields for New Pricing. After discussing with Clay, custom pricing/protected amounts are not allowed in New Pricing. This user-specific changes might be applied in the form of discounts, but these are not defined/implemented yet. There were some fields created in the New Pricing tables that were considering the custom possibility, and Gus confirm that these will come as Null. So I prefer to delete them now, have clean code, and if in X months dev team wants to remove these, we should be good. # Checklist - [X] The edited models and dependants run properly with production data. - [X] The edited models are sufficiently documented. - [X] The edited models contain PK tests, and I've ran and passed them. - [NA] I have checked for DRY opportunities with other models and docs. - [NA] I've picked the right materialization for the affected models. # Other - [ ] Check if a full-refresh is required after this PR is merged. Related work items: #20809	2024-10-08 11:48:25 +00:00
Oriol Roqué Paniagua	ea8972a91b	Merged PR 3090: Propagates Protection Plan to intermediate, for price and cover # Description Propagates Protection Plan to price and cover to intermediate. Ideally I wanted to have a single table that contained prices + covers, but it is not straight forward to assume: * if a modification of the amount covered by a protection plan will create a new protection plan id, or not * if a modification of the price a protection costs will create a new protection plan id, or not So I keep it split for the time being. Besides, use cases might be different, ex: I want to see all prices of services (product + protection plan, but no need for cover). # Checklist - [X] The edited models and dependants run properly with production data. - [X] The edited models are sufficiently documented. - [X] The edited models contain PK tests, and I've ran and passed them. - [X] I have checked for DRY opportunities with other models and docs. - [X] I've picked the right materialization for the affected models. # Other - [ ] Check if a full-refresh is required after this PR is merged. Related work items: #20809	2024-10-08 11:44:52 +00:00
Oriol Roqué Paniagua	bad27ecbc3	Merged PR 3087: Adding int_core__product_service_to_price # Description Adds int_core__product_service_to_price. This model is the equivalent of staging but just adding currency code, product service name and a few boolean fields to identify if the product service to price is currently active and if it's the default price or not. Currently it's a view since it has 16 records, likely it can be transformed into a table in the future. The idea will be to create a similar table for protection_plan. # Checklist - [X] The edited models and dependants run properly with production data. - [X] The edited models are sufficiently documented. - [X] The edited models contain PK tests, and I've ran and passed them. - [X] I have checked for DRY opportunities with other models and docs. - [NA] I've picked the right materialization for the affected models. -> We will see in the future if needs to be adapted # Other - [ ] Check if a full-refresh is required after this PR is merged. Related work items: #20809	2024-10-08 11:39:34 +00:00
uri	911603f2e8	Documenting protection name fields in user_product_bundle	2024-10-07 17:21:52 +02:00
uri	9424a96707	Adding protection names	2024-10-07 17:18:41 +02:00
Pablo Martin	0c3be659b1	remove test	2024-10-07 14:28:05 +02:00

... 5 6 7 8 9 ...

1019 commits