3. Data and methodology

This study uses CoreLogic’s property settled sales transactions data, received on a monthly basis between October 2019 and April 2023.

Relying on real-time analysis terminology, each data release is referred to as a ‘vintage’. For example, the first vintage for settled sales contracted in 2019 Q3 will be the October 2019 data release. This is the first monthly data release of settled sales data for the full quarter of 2019 Q3 (up to and including September 2019) that is available. In this example, 2019 Q3 is the ‘reference quarter.’

Based on separate pieces of analysis, almost all transactions are expected to be settled by the twelfth month. A check of the number of transactions at the 12-month mark shows that it is similar to the number of transactions at the 2-year mark. This suggests that the off-the-plan sales that are anticipated to settle only after two or three years (Shoory, 2016) do not make up a significant portion of the market. Therefore, we assume that the final vintage is released at least 12 months after the first vintage is released. For example, the final vintage for the 2019 Q3 reference quarter will be the October 2020 vintage.

To standardise the real-time data for analysis, vintages are described in terms of the number of months ahead relative to the reference quarter plus one month, e.g. the first vintage for the 2019 Q3 reference quarter is the October 2019 data release, which is 0 months ahead. Table 1 summarises the data vintages and number of months ahead in relation to each reference quarter.

In Victoria, there are approximately 10,000 residential transactions on average that settle each month. Figure 1 summarises the volume of transactions that are received over time for each reference quarter while Table 2 displays the share of transactions that are observed relative to final vintage, as an average over the reference quarters. For a given reference quarter, approximately 60 per cent of total settled property transactions present in the final vintage is observed in the first vintage, on average. By the end of the third month, most of the transactions had been received, which is consistent with the average settlement period from Canstar (2022).

Table 1: Data vintages and months ahead by reference quarter

0m1m2m3m4m5m6m7m8m9m10m11m12m
2019 Q3Oct 19Nov 19Dec 19Jan 20Feb 20Mar 20Apr 20May 20Jun 20Jul 20Aug 20Sep 20Oct 20
2019 Q4Jan 20Feb 20Mar 20Apr 20May 20Jun 20Jul 20Aug 20Sep 20Oct 20Nov 20Dec 20Jan 21
2020 Q1Apr 20May 20Jun 20Jul 20Aug 20Sep 20Oct 20Nov 20Dec 20Jan 21Feb 21Mar 21Apr 21
2020 Q2Jul 20Aug 20Sep 20Oct 20Nov 20Dec 20Jan 21Feb 21Mar 21Apr 21May 21Jun 21Jul 21
2020 Q3Oct 20Nov 20Dec 20Jan 21Feb 21Mar 21Apr 21May 21Jun 21Jul 21Aug 21Sep 21Oct 21
2020 Q4Jan 21Feb 21Mar 21Apr 21May 21Jun 21Jul 21Aug 21Sep 21Oct 21Nov 21Dec 21Jan 22
2021 Q1Apr 21May 21Jun 21Jul 21Aug 21Sep 21Oct 21Nov 21Dec 21Jan 22Feb 22Mar 22Apr 22
2021 Q2Jul 21Aug 21Sep 21Oct 21Nov 21Dec 21Jan 22Feb 22Mar 22Apr 22May 22Jun 22Jul 22
2021 Q3Oct 21Nov 21Dec 21Jan 22Feb 22Mar 22Apr 22May 22Jun 22Jul 22Aug 22Sep 22Oct 22
2021 Q4Jan 22Feb 22Mar 22Apr 22May 22Jun 22Jul 22Aug 22Sep 22Oct 22Nov 22Dec 22Jan 23
2022 Q1Apr 22May 22Jun 22Jul 22Aug 22Sep 22Oct 22Nov 22Dec 22Jan 23Feb 23Mar 23Apr 23
2022 Q2Jul 22Aug 22Sep 22Oct 22Nov 22Dec 22Jan 23Feb 23Mar 23Apr 23
2022 Q3Oct 22Nov 22Dec 22Jan 23Feb 23Mar 23Apr 23
2022 Q4Jan 23Feb 23Mar 23Apr 23
2023 Q1Apr 23

Figure 1: Volume of settled sales transactions across vintages

Figure 1 Volume of settled sales transactions across vintages

Table 2: Average share of transactions that are received relative to final vintage

Months aheadShare of total volume in data release 12 months laterMonths aheadShare of total volume in data release 12 months later
0m62%6m97%
1m79%7m98%
2m90%8m99%
3m95%9m99%
4m97%10m100%
5m98%11m, 12m100%

To estimate the impact of the settlement lag on a hedonic price index, we use the DTF hedonic price index described in Tan and Nasiri (2018). This HPI controls for a variety of factors such as location by postcode, number of bedrooms, bathrooms, car spaces, land size, floor area and age of structure. Using a fixed effects regression, we measure the impacts of each data release through the changes observed in the hedonic price index.

This DTF HPI is estimated for every vintage (i.e. from 0 months ahead, 1 month ahead, and so on to 12 months ahead) to simulate the HPI based on the available information set at each point in time. From here, the through-the-year growth rates are calculated to understand how they evolve across the data vintages. The difference between the HPI growth rate using early vintages relative to the HPI growth rate using final vintage data is calculated for each reference period. The differences are then averaged across reference periods to quantify the evolution to the final growth rate. For example, the 0 months ahead difference for the 2019 Q3, 2019 Q4, …, 2023 Q1 reference periods are averaged to determine the 0 months ahead average difference from the final vintage HPI growth rate. This is repeated for the 1 month ahead, 2 months ahead vintages and so on to the 11 months ahead data vintage. It should be noted that while vintages are released monthly, the HPI is estimated on a quarterly basis.

Updated