User’s Guide : Advanced Univariate Analysis : Univariate Time Series Analysis : Cross-sectionally Dependent Panel Unit Root Tests
  
Cross-sectionally Dependent Panel Unit Root Tests
 
Performing Dependent Panel Unit Root Tests in EViews
Test
ADF Lag Selection
PANIC MQ Options
Factor Selection
p-value Simulation
Dependent Panel Unit Root Test Examples
PANIC MQc Example
PANIC MQf Example
CIPS Example
Dependent Panel Unit Root Tests Background
PANIC Unit Root Testing
PANIC Unit Root Test Algorithm
Computing test data
Testing idiosyncratic components
Testing common factors
CIPS Test
In “Cross-sectionally Independent Panel Unit Root Testing”, we described first generation panel unit root tests on pooled panel data, with (possibly) individual trend, intercepts, and lag coefficients when there is cross-sectional independence.
The assumption of cross-sectional independence can be difficult to justify, as cross-sections are often influenced by common forces, termed factors. For instance, cross-country purchasing power parity (PPP) data is often anchored to a common numeraire currency such as the US dollar, so that movements in the latter affect the cross-sectional PPP in correlated fashion. Tests which account for cross-sectional dependence have been termed second generation panel unit root tests.
EViews currently supports two important second generation contributions: Bai and Ng’s (2004) Panel Analysis of Nonstationarity in Idiosyncratic and Common Components (PANIC), and Pesaran’s (2007) Cross-sectionally Augmented IPS (CIPS).
As with independent panel unit root tests, EViews supports testing in different settings involving multiple series: as a series view (if the workfile is panel structured), as a group view, or as a pool view.
Performing Dependent Panel Unit Root Tests in EViews
Second generation panel unit root tests may be performed on a single series in a panel workfile or on a group of series in a workfile. To perform the test in the panel setting, you should open the series and then click on View/Unit Root Tests/Cross-Sectionally Dependent... In the group setting, open the group and click on the same view entry. EViews will display the dialog:
There is a lot to this dialog so we will examine each section in turn.
Test
The Test section of the dialog is used to specify the basic test type:
The Type dropdown lets you choose between the two second generation panel unit root test types: Bai and Ng - PANIC and Pesaran - CIPS.
The Deterministics dropdown offers three choices for the deterministic terms to be included in the specification: None, Constant, or Constant and trend.
ADF Lag Selection
The ADF lag selection section controls the specification of the ADF test specifications in the PANIC and CIPS tests.
The Method dropdown offers several options for determining the number of lagged difference terms to use in the ADF specifications. You may use the dropdown to select between using information criteria (Akaike, Schwarz, Hannan-Quinn, Modified Akaike, Modified, Schwarz, Modified Hannan-Quinn), performing t-tests of lag difference variable significance, or providing a user-specified fixed value.
For the information criteria and t-test options, you will be prompted for the maximum number of lags to consider; for the user-specified fixed value, you will be prompted to enter the value itself.
Note that EViews pre-fills the Maximum lags edit field with a default value derived from a variation of the Schwert (1989) rule: let there be a lag choice function for and let ; then the default maximum lag .
PANIC MQ Options
If you choose to perform a PANIC test the PANIC MQ options section offers options for controlling the computation:
The Type dropdown lets you choose between the MQC and MQF statistics.
The Significance level edit field lets you choose the level of significance at which EViews will perform the test as the p-value for these statistics is obtained by simulation.
The remaining options will change with your selection of the PANIC type:
If you choose to compute the , the Long-Run Variance Options button will be enabled.
If you choose , the Lag selection options will be enabled so that you may control the order of the VAR(). The settings are as described in “ADF Lag Selection”.
Factor Selection
The Factor selection options control the selection of number of factors method used in the PANIC procedure:
The Method dropdown lets you choose between the Simple, Bai and Ng, and Ahn and Horenstein. In turn, each of these choices offers different options.
If you select Simple, you will choose a Criterion from the dropdown. You may choose from the following selections: Minimum of..., Maximum of..., Average of criteria.
For the first two selections, the number of factors is determined as the minimum or maximum of the Max. Factors (discussed below), the specified Minimum eigenvalue, and Cumulative proportion of eigenvalues. The Average of criteria will use the average of the other two results.
If you select Bai and Ng, EViews will compute the Bai and Ng (2002) as discussed in “Bai and Ng”.
The Criterion dropdown offers all of the information criteria described by Bai and Ng (ICP1, ICP2, ICP3, PCP1, PCP2, PCP3) as well as the Average of criteria which averages the criteria prior to determining the optimal number of factors.
If you select Ahn and Horenstein, EViews will compute the Ahn and Horenstein (2013) number of factor determination procedure ( “Ahn and Horenstein”). The dialog looks the same as for Bai and Ng, but the Criterion dropdown now offers Eigenvalue Ratio, Growth ratio, and Average of criteria.
For User, the number of factors is specified by the user in the edit box
In all of the automatic factor selection procedures above is the selection of maximum factors to be considered. While ultimately an arbitrary selection, EViews offers some suggestions typically seen in the literature. In particular, as in the case of ADF selection, consider the Max. factors dropdown which provides the following options:
Schwert: let there be a lag choice function for and let ; then returns the value .
Ahn and Horenstein: uses the suggestion made on page 1208 in Ahn and Horenstein (2013).
min(root(N), root(T)): uses .
min(N,T): uses .
User: users can specify an arbitrary value in the User maximum factors edit field.
Furthermore, as discussed in Bai and Ng (2002) and Ahn and Horenstein (2013), factor selection procedures can be improved dramatically if time and/or cross-section dimensions are demeaned and/or standardized. To allow for these possibilities, there are four checkboxes associated with each of these options and combinations:
Time-demean: demeans the time dimension.
Time-standardize: standardizes the time dimension.
Cross-demean: demeans the cross-section dimension.
Cross-standardize: standardizes the cross-section dimension.
p-value Simulation
In case of the PANIC tests, inference is based on simulated p-values. In this setting, you can may specify simulation options in the p-value simulations section.
In particular, the number of Monte Carlo replications, Asymptotic length and Random seed can be set in their respective edit fields. Note that the latter is optional. Furthermore, the Random generator dropdown offers various random number generators. Details associated with the latter options can be obtained in the EViews documentation for rndseed.
Dependent Panel Unit Root Test Examples
To illustrate some of the features of second generation panel unit root tests in EViews, we consider three examples using data from Pesaran (2007). These panel data feature quarterly real exchange rates from 17 OECD countries for the period 1973q1 to 1998q4. Data can be downloaded from
http://qed.econ.queensu.ca/jae/2007-v22.2/pesaran/
in the file “real-xrate.zip”.
You may unzip the file then drag-and-drop it onto your EViews to create the new workfile. As is, however, the resulting unstacked, undated workfile is not set up for panel or group unit root testing.
From the command line you can set up the group form of the unstacked data with the following command:
pagestruct(freq=q, start=1973q1)
group oecd_rer *_rate
The first line specifies the frequency of the data (the EViews auto-input was unable to do so since there are no dates in the file). The next line creates a group containing all of the rate series. We can conduct the unit root tests using the group object OECD_RER.
Alternately, we can choose to stack the data in a new page, and conduct the test on the stacked panel data. The command
pagestack ?real_exchange_rate @ ?*
creates a new page containing a stacked panel data series REAL_EXCHANGE_RATE.
Whichever form with which we wish to work, we will use the sample statement
smpl 1974 1998
to the example in Pesaran (2007).
Lastly, note that the examples below use the REAL_EXCHANGE_RATE series in the workfile tab “Realexchangerates7398_STK. All of the examples may also be performed using the group object OECD_RER in the workfile tab “Realexchangerates7398”.
PANIC MQc Example
This example will use default values for all inputs. To use the panel data form of the test, click on the workfile tab “Realexchangerates7398_STK”, and then open the panel series REAL_EXCHANGE_RATE. Click on View/Unit Root Tests/Cross-Sectionally Dependent..., to access the second generation panel unit root tests.
As PANIC is the default test, at this point, one can simply click on OK to compute this test.
The output displays a spool with the spool tree listing the summary, factor selection, common factors, and individual and pooled idiosyncratic elements. The first of these is a summary of the PANIC procedure performed. The second is a table listing the details of the factor selection procedure. In particular, since the selection was the average of the Bai and Ng (2002) test statistics, listed in the table are the factor selections based on each of the six individual statistic variants as well as their average which is used in the remainder of the PANIC procedure.
Next, the common factors table displays the PANIC test for how many common factors influence the panel. The test statistic and associated p-value are reported. In this particular case, that number selected 7, which is also the maximum number of factors allowed using the Schwert (1989) rule.
The table that follows displays the individual ADF test statistics for the idiosyncratic elements associated with each cross-section. The ADF lags are selected for each cross-section independently using the same automatic selection rule, which in this case happens to be AIC with 7 maximum lags. Furthermore, the columns t-stat and p-value report the ADF t-statistic and associated p-value corresponding to the null hypothesis that the idiosyncratic element exhibit a unit root. It is readily seen that apart from Norway, all other cross-sections exhibit a unit root at the 5% significance level.
Lastly, the final table is a pooled version of the individual ADF test statistics in the previous table. Here, we cannot reject the null hypothesis that all of the cross-sections are simultaneously co-integrated.
PANIC MQf Example
For the second example, using the same data, the objective is to perform a PANIC unit root test using the test statistic and the Ahn and Horenstein (2013) factor selection procedure.
From the series window click on View/Unit Root Tests/Cross-Sectionally Dependent.... In the PANIC MQ options section select MQF in the Type dropdown. Next, under the Factor selection group, select Ahn and Horenstein from the Method dropdown. Furthermore, for the Max. factors, select Ahn and Horenstein. Lastly, ensure that Time-demean, Time-standardize, Cross-demean, and Cross-standardize are all checked. Click on OK.
The spool output display is analogous to that in “PANIC MQc Example”. Here, the factor selection procedure is based on Ahn and Horenstein (2013).
Evidently, both the eigenvalue ratio and growth ratio statistics select a single common factor using the Ahn and Horenstein (2013) suggestion for maximum factors, which here happens to be 1.
Since the number of common factors selected is 1, the test associated with the common factors is based on an ADF regression. The next table displays the results of this test.
Here, we see that the test fails to reject the null hypothesis that the single common factor is non-stationary.
Next, as in the previous example, table that follows displays the individual ADF test statistics for the idiosyncratic elements associated with each cross-section. Unlike the previous example, there are significantly more cross-sections for which one can argue rejection of the unit root null hypothesis.
Lastly, the pooled version of the latter tests has a p-value which effectively zero, and therefore we fail to reject the null hypothesis that all cross-sections are not cointegrated.
CIPS Example
The final example uses the same data. Here, the objective is to perform the Pesaran (2007) CIPS test. From the series window click on View/Unit Root Tests/Cross-Sectionally Dependent.... From the Type dropdown, select Pesaran – CIPS. To conform with the analysis in the original paper, change the Method dropdown in the ADF lag selection group to User. We will stick with a single lag for the ADF regression. Hit OK.
The output displays a spool with the spool tree listing the summary, CIPS and CADF unit root tests, as well as CIPS and CADF critical value interpolations.
The first of the spool tables is a summary of the CIPS procedure performed. The second is a table listing the details of the CIPS test. In particular, the first part of this table displays the critical values for the usual CIPS statistic as well as its truncated version. The second portion of this table summarizes the test results. Note that the t-statistic is displayed along with the associated p-values which are summarized categorically based on the critical values tabulated in Pesaran (2007). Here, we cannot reject the null hypothesis below the 10% significance level. Furthermore, the outcome of this table matches the results reported in Table XI of the original paper.
The next table summarizes the ADF test statistics that are used in the computation of the CIPS statistic.
Here, the first portion of the table summarizes the critical values associated with the CADF statistic as well as its truncated version. The second portion of this table summarizes the test results for each of the cross-sections. In particular, these are t-statistics associated with the cross-sectionally augmented ADF regressions for each of the cross-sections. The table summarizes the t-statistic and p-value category for each of the CADF and truncated CADF test statistics. In particular, here we cannot reject the unit root null hypotheses at significance levels less than 10% for any of the cross-sections.
At last, the remaining two tables summarize the interpolation computation used to derive the critical values reported earlier.
In particular, if the T or N columns display values, it implies that interpolation was done on that dimension of the data.
Dependent Panel Unit Root Tests Background
To formalize the discussion, let and , respectively, denote the cross-section and time indices, and consider the following integrated panel data model:
(42.71)
where is the observed data, are the model deterministic dynamics, and are the innovations.
The traditional first generation panel unit root tests of Maddala and Wu (1999), Hadri (2000), Levin, Lin and Chu (2002), and Im, Pesaran and Shin (2003), while differing in unit root parameter heterogeneity assumptions and in test construction approach, all assume that are independent across i.
To introduce cross-sectional dependence, we model as having a factor model structure,
(42.72)
where is a vector of common factors generated by a multivariate white noise process ,
(42.73)
denotes a vector of cross-section specific factor loadings, and is a multivariate idiosyncratic component which is cross-sectionally independent as is assumed to account for all inter-cross-sectional correlations. When are not cross-sectionally independent, the factor model governing is said to be approximate. As usual, is the lag operator, and we have the polynomial:
(42.74)
Substituting, an integrated panel model with cross-sectional dependence may be formulated as
(42.75)
Assumptions imposed on and govern the nature of the process for . For example, when has rank 0, is non-stochastic and stationary, or . Alternately, when has rank , is an process driven by linearly independent stochastic trends. In particular, when , has common trends and cointegrated stationary factors.
A crucial feature of these models is that the common factors and idiosyncratic components can each be or . This generality permits a wide spectrum of possible outcomes for the properties of the observed . For instance, is considered if either or is . Alternately, if the number of common trends , is deemed conditionally stationary or non-stationary, depending on the properties of the idiosyncratic component .
For further detail, see Bai and Ng (2004) and Banerjee and Wagner (2009).
PANIC Unit Root Testing
Bai and Ng’s (2004) PANIC (Panel Analysis of Nonstationarity in Idiosyncratic and Common Components) test is widely considered to be the first unit root test for panel data with cross-sectional dependence. The PANIC test is based on a factor model in which non-stationarity can arise from common factors, idiosyncratic components, or both.
The observed series is modeled using Equation (42.75):
(42.76)
where
(42.77)
and is a mean zero, stationary, and invertible MA process. PANIC assumes a balanced sample where both and are large, such that as both .
Estimation requires a suitable stationary transformation of .
If contains at most a constant, we take the first differences of
(42.78)
If contains a linear trend, is both demeaned and differenced,
(42.79)
where
(42.80)
Then the transformed equation is
(42.81)
where and for and .
Before we turn to the computation of the test, note that an important advantage of PANIC is that the idiosyncratic components can be tested for stationarity regardless of the (non-)stationarity of the common factors. In other words, tests on common factors are asymptotically independent of tests on idiosyncratic components.
PANIC Unit Root Test Algorithm
The algorithm for computing the PANIC unit root test is quite involved and we encourage those interested to seek out more expansive treatments than that given here. Briefly, however, the algorithm involves three parts: computing the factor and idiosyncratic components, testing for a unit root in the idiosyncratic components, testing for a unit root in the common factors.
Computing test data
The first step in PANIC is computing estimates of the common factors and idiosyncratic components :
Arbitrarily specify the number of common factors, or use and to determine , the number of factors to use ( “Determining the Number of Components”).
Compute the principal components decomposition of the matrix, producing the matrix of eigenvectors .
Using , form a matrix of differenced common factors and run a regression of on to obtain the factor loadings matrix .
Compute estimates of the common factors and idiosyncratic components by taking cumulative sums of terms involving and .
Testing idiosyncratic components
Next, PANIC tests for a unit root in the idiosyncratic components:
Compute an ADF regression with lags on the for each cross-section and test for a unit root in the idiosyncratic component.
If we are unable to reject the null hypothesis for a given cross-section, we conclude that there is a unit root in the data for that cross section.
Recall too that early panel unit root tests conducted pooled tests to addressed the low power of univariate unit root tests in face of small . Similarly, we may compute a pooled test statistic based on Maddala and Wu (1999) and Choi (2001),
(42.82)
where is the p-value associated with the idiosyncratic component ADF test for the i-th cross-section.
Observe that although the individual ADF test statistics have a non-standard distribution characterized by functionals of Brownian motions, under cross-sectional independence, the pooled test converges to the standard Gaussian distribution. Furthermore, note that the null hypothesis of this test, that all cross-sections have a unit root, holds only if no stationary combination of the exists. As such, the pooled test is also, in fact, a panel test for no cointegration.
Bear in mind, however, that the pooled version of the idiosyncratic ADF test does require cross-sectional independence, even though the overall PANIC procedure allows for slight correlation between the idiosyncratic components.
Testing common factors
Lastly, PANIC tests for unit roots in the common factors:
If there is only one common factor (), PANIC computes an ADF regression with lags and tests for a unit root in the common factor.
In the multivariate common factor case () perform an iterative testing procedure for determining the number of common trends. This procedure is reminiscent of the Johansen (1991) trace test for cointegrating rank. Bai and Ng consider two test statistics:
1. The first, denoted as , is based on a VAR(1) representation and corrects for serial correlation of arbitrary form through a non-parametric estimation.
2. The second, denoted as , is based on a VAR() representation, and corrects for the presence of serial correlation through a filter procedure.
A brief description of the statistics is warranted here. In particular, these statistics are modified versions of similarly named statistics in Stock and Watson (1988) that were designed to test the null hypothesis that common factors have at most common stochastic trends, against the alternative that they have less than common trends. In this regard, the test adapts the Dickey and Fuller (1979) unit root test to the multivariate settings. On the other hand, the parallels in the multivariate the Phillips (1987) test.
In practice, for each statistic, the testing procedure first computes the relevant for and tests whether the number of common stochastic trends is equal to . If the null is not rejected, we conclude that all common factors are nonstationary. If the null is rejected, we decrement by and repeat until we fail to reject or until .
If fail to reject the null at some , we conclude that among the common factors, are nonstationary, and are stationary.
If we reject the null at , we conclude that all common factors are stationary.
CIPS Test
It is widely acknowledged that Bai and Ng (2004) is among the most general frameworks for panel unit root testing. In this regard, several specializations of PANIC exist and among the most popular and accessible is the Pesaran (2007) test (Cross-sectionally Augmented IPS).
The framework is again the factor augmented integrated panel,
(42.83)
but with the cross-sectional dependency restricted to a single stationary common factor ( and is ).
This model reduces to a simple heterogeneous dynamic linear model. When the deterministics include a trend and intercept we have
(42.84)
where are mean zero, stationary ARMA processes that are cross-sectionally and intra-sectionally independent. In addition to being , is assumed to be serially uncorrelated with mean zero and constant variance, and , , and are mutually independent for all i and t.
We may rewrite Equation (42.84) in Augmented Dickey-Fuller form as
(42.85)
where , the lagged differences are added to account for correlation in the , and represents factor augmentation.
It is important to highlight the fact that non-stationarity arises in this model only when the autoregressive coefficient is equal to 0. Accordingly the null hypothesis for the unit root test is
for all i,
against possibly heterogeneous alternatives,
for and for
such that for and is a positive integer, where, without loss of generality, we reorder the cross-sections to facilitate partitioning by index i.
For sufficiently large, Pesaran (2007) argues that where
(42.86)
are proxies for the common factor . It follows that and may be used to filter out the effects of the unobserved common factor.
Using these proxies we obtain the cross-sectionally augmented ADF (CADF) equation:
(42.87)
Let denote the t-statistic associated with the traditional ADF null hypothesis for cross-section i. Following Im, Pesaran, and Shin (2003), the panel unit root test of interest is a pooled version of individual CADF statistics, or the cross-sectionally augmented (CIPS) statistic:
(42.88)
A truncated version of this test is proposed to counter the influence of extreme outcomes that may arise when is sufficiently small, say . Define the constants and obtained by simulation in Pesaran (2007), and use these values to define a truncated version of ,
(42.89)
Neither CIPS nor truncated CIPS tests have standard limiting distributions. Critical values for popular scenarios have been derived via simulation and tabulated in Pesaran (2007).