Merged PR 2084: Adding int_core__accommodation
Adding int_core__accommodation Includes both: - Main information of the accommodation, mostly coming from stg_core__accommodation and int_core__country. - Listing lifecycle computation, based on the created bookings from stg_core__bookings. It's just the current state, no history. Some considerations: - I opted to use stg_core__bookings and not int_core__bookings. Main reason is in case at some point we want to add listing-based information to the booking table, it would avoid cyclic references. - I opted to keep all the logic of 1) accommodation info and 2) lifecycle in the same model. This could be easily split into: lifecycle first that reads uniquely from staging and then the int_core__accommodation that could read from the staging version to retrieve accommodation attributes + the lifecycle one. Up to you I'd suggest to review first the documentation in schema since it explains the logic applied. Notion page linked to this task: https://www.notion.so/knowyourguest-superhog/Listing-lifecycle-4dc0311b21ca44f8859969e419872ebd Related work items: #17312
This commit is contained in:
parent
80120e68a2
commit
fe93f594f5
2 changed files with 359 additions and 1 deletions
|
|
@ -495,7 +495,197 @@ models:
|
|||
These codes are part of the ISO 4217 standard.
|
||||
tests:
|
||||
- not_null
|
||||
|
||||
|
||||
- name: int_core__accommodation
|
||||
description: |
|
||||
This model contains information regarding accommodations, also known as listings.
|
||||
It contains information regarding the host this accommodation is linked to,
|
||||
the geographic details, the preferred currency according to the country, details about
|
||||
the listing itself (floors, bedrooms, etc) and time-related information of when the
|
||||
listing was created, booked for the first time, last time, and second-to-last time.
|
||||
|
||||
The information regarding the booking-related time allows for the current status of
|
||||
any listing regarding its activity. There's no history, it's just the most up-to-date
|
||||
status of the listing activity. This information is encapsulated in the following columns:
|
||||
|
||||
accommodation_lifecycle_state: contains one of the following states
|
||||
- 01-New: Listings that have been created in the current month, without bookings
|
||||
- 02-Never Booked: Listings that have been created before the current month, without bookings.
|
||||
- 03-First Time Booked: Listings that have been booked for the first time in the current month.
|
||||
- 04-Active: Listings that have booking activity in the past 12 months (that are not FTB nor reactivated)
|
||||
- 05-Churning: Listings that are becoming inactive because of lack of bookings in the past 12 months
|
||||
- 06-Inactive: Listings that have not had a booking for more than 12 months.
|
||||
- 07-Reactivated: Listings that have had a booking in the current month that were inactive or churning before.
|
||||
- Finally, if none of the logic applies, which should not happen, null will be set and a dbt alert will raise.
|
||||
|
||||
Since the states of Active, First Time Booked and Reactivated indicate certain booking activity and are
|
||||
mutually exclusive, the model also provides information of the recency of the bookings by the following
|
||||
booleans:
|
||||
- has_been_booked_in_1_month: If a listing has had a booking created in the current month
|
||||
- has_been_booked_in_6_months: If a listing has had a booking created in the past 6 months
|
||||
- has_been_booked_within_last_12_months: If a listing has had a booking created in the past 12 months
|
||||
Note that if a listing has had a booking created this month, all 3 columns will be true. Similarly,
|
||||
if the last booking created to a listing was 5 months ago, only the column has_been_booked_in_1_month
|
||||
will be false; while the other 2 will be true.
|
||||
|
||||
|
||||
columns:
|
||||
- name: id_accommodation
|
||||
data_type: bigint
|
||||
description: Id of the accommodation or listing. It's the unique key for this model.
|
||||
tests:
|
||||
- not_null
|
||||
- unique
|
||||
|
||||
- name: id_user_host
|
||||
data_type: character varying
|
||||
description: The unique ID for the host. Can be null.
|
||||
|
||||
- name: id_payment_validation_set
|
||||
data_type: bigint
|
||||
description: Id of the payment validation set linked to a listing. Can be null.
|
||||
|
||||
- name: friendly_name
|
||||
data_type: character varying
|
||||
|
||||
- name: country_iso_2
|
||||
data_type: char(2)
|
||||
description: ISO 3166-1 alpha-2 country code where the listing is located.
|
||||
|
||||
- name: country_name
|
||||
data_type: character varying
|
||||
description: Name of the country where the listing is located.
|
||||
|
||||
- name: country_preferred_currency_code
|
||||
data_type: char(3)
|
||||
description: |
|
||||
Three-letter code assigned to the preferred currency for a given country by the ISO.
|
||||
These codes are part of the ISO 4217 standard. Keep in mind this are preferred, not
|
||||
necessarily the actual currency.
|
||||
|
||||
- name: is_active
|
||||
data_type: boolean
|
||||
description: |
|
||||
Boolean to indicate if the listing is active or not. If false, this is considered as a
|
||||
hard deactivation - meaning no more bookings can be assigned to this listing. However,
|
||||
even if a listing is active, that does not necessarily mean that it's receiving bookings.
|
||||
Do not confuse this column with the lifecycle activity of a listing.
|
||||
|
||||
- name: town
|
||||
data_type: character varying
|
||||
|
||||
- name: postcode
|
||||
data_type: character varying
|
||||
|
||||
- name: address_line_1
|
||||
data_type: character varying
|
||||
|
||||
- name: address_line_2
|
||||
data_type: character varying
|
||||
|
||||
- name: verification_level
|
||||
data_type: integer
|
||||
|
||||
- name: floor_area
|
||||
data_type: integer
|
||||
|
||||
- name: number_of_floors
|
||||
data_type: integer
|
||||
|
||||
- name: number_of_bedrooms
|
||||
data_type: integer
|
||||
|
||||
- name: number_of_bathrooms
|
||||
data_type: integer
|
||||
|
||||
- name: number_of_other_rooms
|
||||
data_type: integer
|
||||
|
||||
- name: construction_details
|
||||
data_type: character varying
|
||||
|
||||
- name: accommodation_lifecycle_state
|
||||
data_type: character varying
|
||||
description: |
|
||||
Contains the lifecycle state of a Listing. The accepted values are:
|
||||
01-New, 02-Never Booked, 03-First Time Booked, 04-Active, 05-Churning, 06-Inactive,
|
||||
07-Reactivated. Failing to implement the logic will result in alert.
|
||||
tests:
|
||||
- not_null
|
||||
|
||||
- name: has_been_booked_within_current_month
|
||||
data_type: boolean
|
||||
description: If the listing has had a booking created in the current month.
|
||||
|
||||
- name: has_been_booked_within_last_6_months
|
||||
data_type: boolean
|
||||
description: If the listing has had a booking created in the past 6 months.
|
||||
|
||||
- name: has_been_booked_within_last_12_months
|
||||
data_type: boolean
|
||||
description: If the listing has had a booking created in the past 12 months.
|
||||
|
||||
- name: created_at_utc
|
||||
data_type: timestamp
|
||||
description: Timestamp of when the listing was created. Cannot be null.
|
||||
tests:
|
||||
- not_null
|
||||
|
||||
- name: created_date_utc
|
||||
data_type: date
|
||||
description: Date of when the listing was created
|
||||
|
||||
- name: updated_at_utc
|
||||
data_type: timestamp
|
||||
description: Timestamp of when the listing was last updated according to the backend.
|
||||
|
||||
- name: updated_date_utc
|
||||
data_type: date
|
||||
description: Date of when the listing was last updated according to the backend.
|
||||
|
||||
- name: first_time_booked_at_utc
|
||||
data_type: timestamp
|
||||
description: |
|
||||
Timestamp of the first booking created for a given listing. Can be null if the listing
|
||||
has never had a booking associated with it.
|
||||
|
||||
- name: first_time_booked_date_utc
|
||||
data_type: date
|
||||
description: |
|
||||
Date of the first booking created for a given listing. Can be null if the listing
|
||||
has never had a booking associated with it.
|
||||
|
||||
- name: last_time_booked_at_utc
|
||||
data_type: timestamp
|
||||
description: |
|
||||
Timestamp of the last booking created for a given listing. Can be null if the listing
|
||||
has never had a booking associated with it. Can be the same as first_time_booked_at_utc
|
||||
if the listing only had 1 booking in its history.
|
||||
|
||||
- name: last_time_booked_date_utc
|
||||
data_type: date
|
||||
description: |
|
||||
Date of the last booking created for a given listing. Can be null if the listing
|
||||
has never had a booking associated with it. Can be the same as first_time_booked_date_utc
|
||||
if the listing only had 1 booking in its history.
|
||||
|
||||
- name: second_to_last_time_booked_at_utc
|
||||
data_type: timestamp
|
||||
description: |
|
||||
Timestamp of the second-to-last booking created for a given listing, meaning the creation
|
||||
time of the booking that precedes the last one. It's relevant for the reactivation computation
|
||||
on the lifecycle. Can be null if the listing has never had a booking associated with it or if
|
||||
the listing only had 1 booking in its history.
|
||||
|
||||
- name: second_to_last_time_booked_date_utc
|
||||
data_type: date
|
||||
description: |
|
||||
Date of the second-to-last booking created for a given listing, meaning the creation
|
||||
date of the booking that precedes the last one. It's relevant for the reactivation computation
|
||||
on the lifecycle. Can be null if the listing has never had a booking associated with it or if
|
||||
the listing only had 1 booking in its history.
|
||||
|
||||
- name: dwh_extracted_at_utc
|
||||
data_type: timestamp
|
||||
description: Timestamp of when the accommodation record was extracted from the backend into the DWH.
|
||||
|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue