Commit graph

427 commits

Author SHA1 Message Date
Pablo Martin
854afffc1c add account code 2024-06-25 15:07:39 +02:00
Pablo Martin
8dc31f3489 add accounts to staging 2024-06-25 15:07:39 +02:00
Pablo Martin
7a9c913b8a line items 2024-06-25 15:07:39 +02:00
Pablo Martin
d86b6b8627 add tests 2024-06-25 15:07:39 +02:00
Pablo Martin
33d072015c remove full jsons with ids 2024-06-25 15:07:39 +02:00
Pablo Martin
25e9d040ff add internal exchange rates 2024-06-25 15:07:39 +02:00
Pablo Martin
4b0b15814e add date converted in staging 2024-06-25 15:07:39 +02:00
Pablo Martin
bf54548654 table alias 2024-06-25 15:07:39 +02:00
Pablo Martin
200c324b68 add converted fields 2024-06-25 15:07:39 +02:00
Pablo Martin
08ff92b3e9 make stg naming coherent with other xero models 2024-06-25 15:07:39 +02:00
Pablo Martin
447cb3926c bring over bank transactions to int 2024-06-25 15:07:39 +02:00
Oriol Roqué Paniagua
6c053a0753 Merged PR 2107: Adds host lifecycle metrics into biz kpis
This PR closes the first draft of the first batch of business kpis. Host logic has changed to be applied at deal id level.
It's mostly an adapted copy-paste from the accommodation counterpart, specifically:
- `int_core__mtd_deal_lifecycle`: computes the historic deal lifecycle. One line for each deal and MTD date. **Important**: _Not all hosts have a deal set. This will need a data quality report for business teams to fix_
- `int_core__mtd_deal_metrics`: computes the aggregation at MTD date level of the metrics per lifecycle state and activity state

Additionally, this PR changes:
- `int_core__mtd_aggregated_metrics`: it includes the new 3 deal metrics and changes the source of the already existing 3 deal metrics from `mtd_booking_metrics` to the new `mtd_deal_metrics`
- `int_core__mtd_booking_metrics`: removes all code needed to compute the remaining deal metrics, speeding it up considerably.

After this PR, the mtd models run (locally) at the following speed:
- `int_core__mtd_accommodation_lifecycle`: 47 sec
- `int_core__mtd_deal_lifecycle`: 3 sec
- `int_core__mtd_accommodation_metrics`: 5 sec
- `int_core__mtd_deal_metrics`: < 1 sec
- `int_core__mtd_booking_metrics`: 8 sec (quite a reduction)
- `int_core__mtd_guest_journey_metrics`: 5 sec
- `int_core__mtd_aggregated_metrics` and `core__mtd_aggregated_metrics`: < 1 sec

Related work items: #17312
2024-06-25 12:20:59 +00:00
Pablo Martín
cebd6f6485 Merged PR 2106: Add VerificationSet to staging
Brings the VerificationSet from Core into staging.

Related work items: #17068
2024-06-25 09:13:11 +00:00
Pablo Martin
cfd752843b add tests 2024-06-25 11:07:53 +02:00
Pablo Martin
c23e79b5fe add staging model 2024-06-25 11:04:28 +02:00
Pablo Martin
8fbbb1d10b add source 2024-06-25 11:04:16 +02:00
Oriol Roqué Paniagua
0655ac8997 Merged PR 2105: Adding listing lifecycle metrics into business KPIs
This PR will compute the listing metrics in an aggregated manner to be displayed in the Main KPIs dashboard, specifically:
- New Listings
- First Time Booked Listings
- Churning Listings
- It also adapts the computation for the already existing metrics of Listings Booked in X months

At code level, it contains the following:
- Adds `int_core__mtd_accommodation_metrics`, which computes the aggregation of the lifecycle of listings at date level (unique), being date the corresponding date from `int_dates_mtd`
- Changes `int_core__mtd_aggregated_metrics` to take the accommodation metrics from the new model. Those 3 already existing (Listings booked in X month) now read from the new model as well.
- Changes `int_core__mtd_booking_metrics` to remove unused computation, making it lighter. Specifically, it removes 1) listing related metrics, since now we have a dedicated model and 2) number of guests booked, since it's not used at all.

The resulting values in local are consistent with what is already reported in the staging report.

Related work items: #17312
2024-06-25 08:14:23 +00:00
Oriol Roqué Paniagua
f23e210129 Merged PR 2094: Removing lifecycle logic from int_core__accommodation
Removing lifecycle logic from int_core__accommodation
This logic is now available on int_core__mtd_accommodation_lifecycle

Related work items: #17312
2024-06-21 14:13:42 +00:00
Oriol Roqué Paniagua
ef80637a9b Merged PR 2090: Adding int_core__mtd_accommodation_lifecycle
Adding int_core__mtd_accommodation_lifecycle. Mainly, it recreates the history of the lifecycle of a listing for each date appearing in the MTD dates (so, last day of month + days for current month + days for current month of the previous year).

Implementation of lag function makes it much faster than self-join. Runs in approx 17 seconds (in local)

The logic behind the lifecycle is the same, and the most-up-to-date results in my local show the same values for the new model and the int_core__accommodation model (see screenshots)

previous model:
![image.png](https://guardhog.visualstudio.com/4148d95f-4b6d-4205-bcff-e9c8e0d2ca65/_apis/git/repositories/54ac356f-aad7-46d2-b62c-e8c5b3bb8ebf/pullRequests/2090/attachments/image.png)

new model:
![image (2).png](https://guardhog.visualstudio.com/4148d95f-4b6d-4205-bcff-e9c8e0d2ca65/_apis/git/repositories/54ac356f-aad7-46d2-b62c-e8c5b3bb8ebf/pullRequests/2090/attachments/image%20%282%29.png)

Following PRs will focus on readapting logic of int_core__accommodation to avoid the replication of lifecycle computation (just re-use the last available date in int_core__mtd_accommodation_lifecycle) and the creation of the desired metrics for the Biz Overview dashboard, including a refactor of the mtd_bookings to remove the listing logic from there.

Related work items: #17312
2024-06-21 13:59:14 +00:00
Oriol Roqué Paniagua
fe93f594f5 Merged PR 2084: Adding int_core__accommodation
Adding int_core__accommodation

Includes both:
- Main information of the accommodation, mostly coming from stg_core__accommodation and int_core__country.
- Listing lifecycle computation, based on the created bookings from stg_core__bookings. It's just the current state, no history.

Some considerations:
- I opted to use stg_core__bookings and not int_core__bookings. Main reason is in case at some point we want to add listing-based information to the booking table, it would avoid cyclic references.
- I opted to keep all the logic of 1) accommodation info and 2) lifecycle in the same model. This could be easily split into: lifecycle first that reads uniquely from staging and then the int_core__accommodation that could read from the staging version to retrieve accommodation attributes + the lifecycle one. Up to you

I'd suggest to review first the documentation in schema since it explains the logic applied.

Notion page linked to this task: https://www.notion.so/knowyourguest-superhog/Listing-lifecycle-4dc0311b21ca44f8859969e419872ebd

Related work items: #17312
2024-06-20 16:02:16 +00:00
Pablo Martín
80120e68a2 Merged PR 2087: Add Xero Bank Transactions to staging
- Main purpose is to bring the Bank Transactions table from sync to stg.

- Along the way:
  - I created a couple of new generic tests.
  - I reviewed docs and tests for a couple of other Xero models that were not in the best shape.

Related work items: #17551
2024-06-20 14:42:03 +00:00
Pablo Martin
696490aecb docs for bank transactions table 2024-06-20 15:52:55 +02:00
Pablo Martin
26e7358648 credit notes docs 2024-06-20 15:26:33 +02:00
Pablo Martin
e1a4faf3b9 more contact tests 2024-06-20 15:15:19 +02:00
Pablo Martin
8f14f5d37d add length between test 2024-06-20 15:13:21 +02:00
Pablo Martin
7aa667f7cf boy scout: tests for invoices 2024-06-20 15:00:59 +02:00
Pablo Martin
34b678df26 new not_negative test 2024-06-20 14:43:51 +02:00
Pablo Martin
c671701071 boy scout: contact tests 2024-06-20 14:37:55 +02:00
Pablo Martin
79d69a9286 staging model 2024-06-20 14:31:12 +02:00
Pablo Martin
cd9c797f02 add source 2024-06-20 14:12:30 +02:00
Pablo Martin
05f35fcb69 add value to test 2024-06-19 18:25:49 +02:00
Pablo Martín
5bea450428 Merged PR 2083: Add values to Stripe tests
There's a couple of broken tests for Stripe data in production for some `accepted_values` one, but it's just because our test is too strict and doesn't really account for all values.

This PR fixes plus drops a small comment on the funky sources table which won't hurt for future modelers.
2024-06-19 16:23:49 +00:00
Pablo Martin
6ad203b92f add some comments 2024-06-19 18:18:38 +02:00
Oriol Roqué Paniagua
839e5fae1b Merged PR 2077: Adding Country to intermediate
Adding Country to intermediate, both model + documentation.

At this stage, the model is set as a view but we can discuss what is the best approach

Related work items: #17312
2024-06-19 15:34:15 +00:00
Joaquin Ossa
4a80625eeb Merged PR 2064: Created currency exchange tables
Created currency exchange tables in general for new dashboard

Related work items: #17323
2024-06-19 10:01:20 +00:00
Joaquin Ossa
31feff162a Fixed names of tables 2024-06-19 11:59:45 +02:00
Joaquin Ossa
f64b17a223 Created currency exchange tables 2024-06-19 11:52:28 +02:00
Oriol Roqué Paniagua
fb86850ffe Merged PR 2074: Adding country to staging
Adding country to staging

Related work items: #17312
2024-06-19 08:40:08 +00:00
Oriol Roqué Paniagua
771b226888 Merged PR 2068: Adding cancelled bookings metric
Adding cancelled bookings metric based on the feedback of the tech team. Mainly, the date of a cancelled booking can be considered as the `updated_date_utc` for those bookings with status cancelled, as it's a terminal state and no additional steps should follow.

I also took the opportunity to update:
- The order on the `int_core__mtd_aggregated_metrics`, so it matches the one in the Notion page for the 1st batch, freeing already the space for the order_by numbers for missing metrics
- Make acronyms of alias in the `main_kpi` subquery in `int_core__mtd_booking_metrics` slightly more clear
- Remove empty line at the end of the file in `int_core__mtd_booking_metrics`

Keep in mind that the cancelled bookings metric will be directly available in the dashboard once this PR is approved and DWH re-runs.

Related work items: #17310
2024-06-18 14:58:55 +00:00
Pablo Martín
06fc35e58d Merged PR 2063: Part 2 of the previous PR
Adds tests and also places an upper on a field in staging for funky reasons
2024-06-18 11:41:55 +00:00
Pablo Martin
344419e660 formatting 2024-06-18 13:41:32 +02:00
Pablo Martin
3d3f3fb9ab fix docs 2024-06-18 13:40:36 +02:00
Pablo Martin
27759374b8 add docs and tests 2024-06-18 13:10:47 +02:00
Pablo Martin
c051831d6d upper in payments 2024-06-18 13:09:45 +02:00
Pablo Martín
361b1b6a7f Merged PR 2061: Deprecate hardcoded rates
This PR replaces the old hardcoded rates with the new ones from xe.com in the only model that used them.

Related work items: #17493
2024-06-18 10:34:32 +00:00
Pablo Martin
662c7b8ba8 remove hardcoded rates and seed, remove docs 2024-06-18 11:35:07 +02:00
Pablo Martin
8b91babad6 remove old table from cte 2024-06-18 11:31:59 +02:00
Pablo Martin
1f9df9ea5c remove old table 2024-06-18 11:24:03 +02:00
Pablo Martin
d96c8b2abb change which field we join on: payment date is not always informed 2024-06-18 11:23:10 +02:00
Pablo Martin
6f625ec7db do both rates to compare and test 2024-06-18 11:13:51 +02:00