Testing the neoclassical migration model: overall and age-group specific results for German regions

Mitze, Timo; Reinkowski, Janina

doi:10.1007/s12651-010-0046-2

Research Paper
Published: 24 December 2010

Testing the neoclassical migration model: overall and age-group specific results for German regions

Tests zur Validität des neoklassischen Migrationsmodells: Allgemeine und altersgruppenspezifische Resultate für deutsche Raumordnungsregionen

Timo Mitze^1,2 &
Janina Reinkowski³

Zeitschrift für ArbeitsmarktForschung volume 43, pages 277–297 (2011)Cite this article

1971 Accesses
13 Citations
Metrics details

Abstract

This paper tests the empirical validity of the neoclassical migration model in predicting German internal migration flows. We estimate static and dynamic migration functions for  97 Spatial Planning Regions between 1996 and 2006 using key labor market signals including income and unemployment differences among a broader set of explanatory variables. In addition to an aggregate specification we also estimate the model for age-group related subsamples. Our results give empirical support for the main transmission channels identified by the neoclassical framework – both at the aggregate level as well as for age-group specific estimates. Thereby, the impact of labor market signals is tested to be of greatest magnitude for workforce relevant age groups and especially young cohorts between the ages of  18 to  25 and  25 to  30. This latter result emphasizes the prominent role played by labor market conditions in determining internal migration rates of the working population in Germany.

Zusammenfassung

Dieses Papier untersucht die empirische Validität des neoklassischen Migrationsmodells zur Erklärung interner Migrationsströme in Deutschland. Die Schätzungen basieren auf Daten für  97 Raumordnungsregionen im Zeitraum 1996 bis 2006. Neben einer aggregierten Migrationsgleichung werden auch disaggregierte Modelle für verschiedene Altersgruppen geschätzt. Sowohl die aggregierten als auch altersgruppenspezifischen Resultate bestätigen den durch das neoklassische Migrationsmodell vorhergesagten Zusammenhang zwischen interner Migration und regionalen Arbeitsmarktungleichgewichten. Dabei zeigen die Ergebnisse, dass insbesondere junge Alterskohorten zwischen  18 und  25 sowie  25 und  30 Jahren stark auf Unterschiede im regionalen Lohnniveau und der Arbeitslosenquote reagieren.

1 Introduction

There are many theories aiming to explain why certain people migrate and others do not. However, the neoclassical model remains the standard workhorse specification for analyzing internal and external migration rates at regional, national and international levels. The model places special emphasis on the labor market dimension of migration and basically relates migration-induced population changes to the relative income (or wage) and employment situation found in the regions of origin and destination.

In its response, migration works as an equilibrating mechanism for balancing differences among regions with respect to key labor market variables since higher in-migration in a region is expected to reduce the regional wage level due to an increase in labor supply. From the perspective of economic policy making, the empirical implications of the neoclassical migration model are important in order to assess whether labor mobility can act as an appropriate adjustment mechanism in integrated labor markets facing asymmetric shocks. Though the neoclassical migration model is widely used as a policy simulation and didactic tool, international empirical evidence so far has provided rather mixed results.

In this paper, we therefore aim to check the validity of the neoclassical migration model using a panel of  97  German regions for the period 1996–2006. We are especially interested in taking a closer look at the role played by time dynamic adjustment processes driving the internal migration patterns. We also aim to identify the role of additional factors as well as regional amenities in explaining migratory movements and key labor market signals. Finally, we focus on the heterogeneity of the adjustment processes taking place when migration flows are disaggregated by age groups.

The remainder of the paper is organized as follows: Section  2 sketches the theoretical foundations of the neoclassical migration model. Building on its theoretical underpinnings, Sect.  3 discusses the estimation approach with a special focus on dynamic panel data models. Section  4 then presents a selected literature review for empirical studies dealing with the determinants of internal migration flows. Section  5 describes the data used and displays stylized facts for German internal migration flows and regional labor market trends. Section  6 presents the empirical results for the total sample as well as for different age groups. Apart from an economic interpretation of the estimation coefficients obtained, we also carefully look at likely model misspecifications such as cross-sectional dependence in the error terms. Section  7 concludes the paper.

2 The neoclassical migration model

Given the complex nature of the decision making process faced by individuals, there is a large variety of theoretical models available to explain the actual migration outcome. These models may either be classified as micro or macroeconomic in nature. Given the scope of this paper, in the following we focus on the latter class, which particularly addresses the labor market dimension of migratory flows. However, as for many macro relationships, the neoclassical migration model is also grounded on solid microeconomic foundations. Its derivation starts from a lifetime expected income (utility) maximization approach as specified in the classical work on the human capital model of migration (see Sjaastad 1962). The human capital model in fact views the process of migration as an investment decision, where the returns to migration in terms of higher wages associated with a new job should exceed the costs involved in moving.

Relaxing the assumption that prospective migrants have perfect information about the wage rates and job availabilities among all potential locations involved in their decision making process, Todaro (1969) proposed a model framework where migrants discount wages by the probability of finding a job in alternative regions. Throughout the decision making process, each individual compares the expected (rather than observed) income level he would obtain if were to stay in his home region  (i) with the expected income we would obtain in the alternative region  (j) and further accounts for ‘transportation costs’ of moving from region  i to  j.

Harris and Todaro (1970) further formalize this idea. The authors set up a model where the expected income from staying in the region of residence  $ { Y^{E}_{ii} } $ is a function of the wage rate or income in region $ { i\,\,(Y_{i}) } $ and the probability of being employed $ { (\text{Prob}(\text{EMP}_{i})) } $. The latter in turn is assumed to be a function of the unemployment rate in region $ { i\,\,(U_{i}) } $ and a set of further economic and non-economic determinants  $ { (X_{i}) } $. The same setup holds for region  j accordingly. Taking costs of moving from region  i to  j into account  $ { (C_{ij}) } $, the individual's decision will be in favor of moving to region  j if

$$ Y^{E}_{ii} \,{\textless}\, Y^{E}_{ij} - C_{ij} \,, {} $$

(1)

where $ { {Y^{E}_{ii}}\,{=}\,{f(\text{Prob}(\text{EMP}_{i}),}\,\,{Y_{i}}{)} } $ and $ { {Y^{E}_{ij}}\,{=}\,{f(\text{Prob}(\text{EMP}_{j}),}\break {Y_{j}}{)} } $. The potential migrant weights the proposed wage level in the home and target regions with the individual probability of finding employment. Using this information, we can set up a model for the regional net migration rate  $ { (\text{NM}_{ij}) } $ defined as regional in-migration flows to  i from  j relative to outmigration flows from  i to  j (possibly normalized by the regional population level), which has the following general form:

$$ \begin{aligned} \text{INM}_{ij} - \text{OUTM}_{ij} &= \text{NM}_{ij}\\ &= f(Y_{i}, Y_{j}, U_{i}, U_{j}, X_{i}, X_{j}, C_{ij}) \,. {} \end{aligned} $$

(2)

With respect to the theoretically motivated signs of the explanatory variables, the model predicts that an increase in the home region wage rate (or, alternatively, the real income level) ceteris paribus leads to higher net migration inflows, while a wage rate increase in region  j results in a decrease of the net migration rate. On the contrary, an increase in the unemployment rate in region  i  (j) has negative (positive) effects on the bilateral net migration from  i to  j. The costs of moving from  i to  j are typically expected to be an impediment to migration and are negatively correlated with net migration as:

$$ \begin{aligned} &\frac{\partial \text{NM}_{ij}}{\partial Y_{i}} \,{\textgreater}\, 0\,; \quad \frac{\partial \text{NM}_{ij}}{\partial Y_{j}} \,{\textless}\, 0\,; \quad \frac{\partial \text{NM}_{ij}}{\partial U_{i}} \,{\textgreater}\, 0\,;\\ &\frac{\partial \text{NM}_{ij}}{\partial U{j}} \,{\textless}\, 0\,; \quad \frac{\partial \text{NM}_{ij}}{\partial C_{ij}} \,{\textless}\, 0 \,. {} \end{aligned} $$

(3)

Core labor market variables may nevertheless not be sufficient to fully predict regional migration flows. We may extend the model by further driving forces of migration such as human capital, the regional competitiveness, housing prices, population density and environmental conditions, among others (see, e.g. Napolitano and Bonasia 2010, for an overview). For notational purposes, in the following we refer to the neoclassical migration model solely focusing on labor market conditions as the ‘baseline’ specification, while the ‘augmented’ specification also checks for regional amenities and further driving forces such as the regional skill level, population density and commuting flows as a substitute for migratory movements.

The likely impact of additional variables in the augmented neoclassical framework can be sketched as follows. Taking human capital as an example, it may be quite reasonable to relax the assumption of the Harris–Todaro model that an uneducated laborer has the same chance of getting a job as an educated laborer. Instead, the probability of finding a job is also a function of the (individual but also region specific) endowment with human capital (HK). The same logic holds for regional competitiveness (INTCOMP). Here, we expect that regions with a high competitiveness are better equipped to provide job opportunities than regions lagging behind (where regional competitiveness may, e.g. be proxied by the share of foreign turnover relative to total turnover in sectors with internationally tradable goods). For population density (POPDENS), we expect a positive impact of agglomeration forces on net flows through an increased possibility of finding a job, given the relevance of spillover effects, e.g. from a large pooled labor market. Thus, the probability of finding employment in region  i in the augmented neoclassical migration model takes the following form:^{Footnote 1}

$$ \begin{aligned} {\!\!\!\!}\text{Prob}(\text{EMP}_{i}) &= f[ U_{i}, \text{HK}_{i}, \text{INTCOMP}_{i}, \text{POPDENS}_{i} ]\,,\\ & {\quad\:\,} \mbox{with:} \quad {\!\!}\frac{\partial \text{NM}_{ij}}{\partial \text{HK}_{i}} \,{\textgreater}\, 0\,; \frac{\partial \text{NM}_{ij}}{\partial \text{INTCOMP}_{i}} \,{\textgreater}\, 0\,;\\[3pt] & {\quad\quad\quad\quad\:\:}{\!\!}\frac{\partial \text{NM}_{ij}}{\partial \text{POPDENS}_{i}} \,{\textgreater}\, 0 \,. \\[-24pt]\\[-24pt]\\[12pt]{} \end{aligned} $$

(4)

Moreover, we also carefully account for alternative adjustment mechanisms such as interregional net commuting flows to restore the inter-regional labor market equilibrium along with migratory movements. As Alecke and Untiedt (2001) point out, the theoretical as well as empirical literature with respect to interregional commuting (different from intraregional commuting) is rather scarce. According to Evers (1989), theoretical models of interregional commuting base the commuting decision on driving forces similar to those outlined in the migration framework. We thus expect that these flows are negatively correlated with net in-migration after controlling for common determinants such as regional income differences.

Finally, regional amenities are typically included as a proxy variable for (unobserved) specific climatic, ecological or socio-economic conditions in a certain region. According to the amenity approach regional differences in labor market signals then only exhibit an effect on migration after a critical threshold has been passed. Since in empirical terms it is often hard to operationalize amenity relevant factors, Greenwood et  al. (1991) proposed to test the latter effect by the inclusion (macro)regional dummy variables in the empirical model. For the long run net migration equation, amenity-rich regions should have dummy coefficients greater than zero, indicating that those regions exhibit higher than average in-migration rates, as would be expected after controlling for regional labor market and macroeconomic differences.

3 Econometric specification

3.1 Functional form of the empirical migration equation

For empirical estimation of the neoclassical migration model we start from its baseline specification as, e.g., applied by Puhani (2001) and set up a model for the net migration rate as:

$$ \left(\frac{\text{NM}_{ij,t}}{\text{POP}_{i,t-1}} \right) = A_{i,t} \left(\frac{U^{\alpha_{1}}_{i,t-1}Y^{\alpha_{2}}_{i,t-1}}{U^{\alpha_{3}}_{j,t-1}Y^{\alpha_{4}}_{j,t-1}}\right) \,, {} $$

(5)

where net migration rate between  i and  j is defined as regional net balance  NM for region  i relative to the rest of the country  j, POP is the region's  i population level,  t is the time dimension.^{Footnote 2}  A is a (cross-section specific) constant term. In the empirical literature, a log-linear stochastic form of the migration model in Eq.  (5) is typically chosen, where lower case variables denote logs and $ { {\textit{nmr}_{ij,t}}\,{=}\,{\log (\text{NM}_{ij,t}}{/}\break {\text{POP}_{i,t-1})} } $ as

$$ \begin{aligned} \textit{nmr}_{ij,t} & = \alpha_{0} + \alpha_{1} y_{i,t-1} + \alpha_{2} y_{j,t-1} \\ & \quad + \alpha_{3} u_{i,t-1} + \alpha_{4} u_{j,t-1} + \alpha_{5}\textbf{X} + e_{ij,t}\,, {} \end{aligned} $$

(6)

where  $ { e_{ij,t} } $ is the model's error term. Taking into account that migration flows typically show a degree of persistence over time, we augment Eq.  (6) by including one-period lagged values of net migration

$$ \begin{aligned} \textit{nmr}_{ij,t} & = \beta_{0} + \beta_{1} nmr_{ij,t-1} + \beta_{2} y_{i,t-1} + \beta_{3} y_{j,t-1} \\ & \quad + \beta_{4} u_{i,t-1} + \beta_{5} u_{j,t-1} + \beta_{6}\textbf{X} + e_{ij,t} \,. {} \end{aligned} $$

(7)

The inclusion of a lagged dependent variable can be motivated by the existence of social networks in determining internal migration flows over time. Rainer and Siedler (2009), for example, find for German micro data that the presence of family and friends is indeed an important predictor for migration flows in terms of communication links, which may result in a gradual adjustment process over time for migration flows out of a particular origin to a destination region.

To account for the role played by timely adjustment processes in the endogenous variable, in the context of panel data models specific estimation techniques based on instrumental variables have to be applied. Besides the problem arising from a dynamic model specification, these techniques, in combination with an appropriate lag selection for the further explanatory variables, it may also help to minimize the fundamental endogeneity problem in this model setup, which arises from a two-way causality between internal migration and regional labor market variables. We give a detailed discussion of the latter point throughout the outline of the applied estimation techniques in the following.

Finally, in applied work one typically finds a restricted version of Eq.  (7) where net migration is regressed against regional differences of explanatory variables of the form (see, e.g. Puhani 2001)

$$ \begin{aligned} \textit{nmr}_{ij,t} & = \gamma_{0} + \gamma_{1} \textit{nmr}_{ij,t-1} + \gamma_{2} \tilde{y}_{ij,t-1} + \gamma_{3} \tilde{u}_{ij,t-1} \\ & \quad\, + \gamma_{4} \textbf{X} + e_{ij,t} \,, {} \end{aligned} $$

(8)

where $ { \tilde{x}_{ij,t} } $ for a variable  $ { x_{ij,t} } $ denotes $ { {\tilde{x}_{ij,t}}\,{=}\,{x_{i,t}}\,{-}\,{x_{j,t}} } $. The latter specification implies the following testable restrictions

$$ \beta_{2} = -\beta_{3} \,, {} $$

(9)

$$ \beta_{4} = -\beta_{5} \,. {} $$

(10)

3.2 Choice of estimation technique and model misspecification tests

For estimation purposes we then have to find an appropriate estimator that is capable of handling the above described empirical setup. Given the dynamic nature of the neoclassical migration model in Eq.  (7), we can write the specified form in terms of a more general dynamic panel data model as (in log-linear specification):

$$ \begin{aligned} y_{i,t} & = \alpha_{0} + \alpha_{1} y_{i,t-1} + \sum_{j=0}^{k}\beta_{j}^\prime X_{i,t-j} + u_{i,t} \,,\\ & \quad\,\, \mbox{with:} \quad u_{i,t} = \mu_{i} + \nu_{i,t} \,, {} \end{aligned} $$

(11)

again $ { {i}\,{=}\,{1,}\,{\ldots,}\,{N} } $ (cross-sectional dimension) and $ { {t}\,{=}\break {1,}\,{\ldots,}\,{T} } $ (time dimension).  $ { y_{i,t} } $ is the endogenous variable and  $ { y_{i,t-1} } $ is one-period lagged value.  $ { X_{i} } $ is the vector of explanatory time-varying and time invariant regressors,  $ { u_{i,t} } $ is the combined error term, where  $ { u_{i,t} } $ is composed of the two error components  $ { \mu_{i} } $ as the unobservable individual effects and  $ { \nu_{i,t} } $ is the remaining error term. Both  $ { \mu_{i} } $ and  $ { \nu_{i,t} } $ are assumed to be i.i.d. residuals with standard normality assumptions.

There are numerous contributions in the recent literature on how to estimate a dynamic model of the above type, which especially deal with the problem introduced by the inclusion of a lagged dependent variable in the estimation equation and its built-in correlation with the individual effect; that is, sincet  $ { y_{it} } $ is a function of  $ { \mu_{i} } $, also  $ { y_{i,t-1} } $ is a function of  $ { \mu_{i} } $ and thus  $ { y_{i,t-1} } $ as right-hand side regressor in Eq.  (11) is likewise correlated with the combined error term. Even in the absence of serial correlation of  $ { \nu_{it} } $ this renders standard  $ { \lambda } $-class estimators such as OLS, the fixed effects model (FEM) and the random effects model (REM) inconsistent (see, e.g. Nickell 1981, Sevestre and Trogon 1995 or Baltagi 2008, for an overview).

Next to direct approaches aiming to correct for the bias of the FEM (see, e.g. Kiviet 1995, Everaert and Pozzi 2007, and the related literature for analytical or bootstrapping-based correction factors), the most widely applied approaches of dealing with this kind of endogeneity typically applies instrumental variable (IV) and generalized methods of moments (GMM) based techniques. While the first generation of models used transformations in first differences, latter extensions also account for the information in levels, when setting up proper estimators. A common tool is the system GMM estimator by Blundell and Bond (1998) as weighted average of first difference and level GMM.

Especially the latter estimators are a good candidate to simultaneously handle the problem arising from the inclusion of the lagged migration variable in our empirical model and the fundamental endogeneity problem induced by two-way causality between migration and labor market variables. In our case, the combination of an appropriate lag selection for the right-hand side regressors combined with the IV approach may do so. That is, since we include labor market variables with a lag structure in Eq.  (7), by definition there cannot be any direct feedback effect from  $ { \textit{nmr}_{ij,t} } $ to labor market variables. However, since  $ { \textit{nmr}_{ij,t-1} } $ enters contemporaneously with respect to the latter, there is still the risk of two-way interdependencies due to the dynamic setting of the model. We minimize these potential risks of any endogeneity bias by instrumenting  $ { \textit{nmr}_{ij,t-1} } $ with its lagged values so that the possibility of feedback effects from migration responses to labor market changes as source of estimation bias is limited. This should lead to consistent estimates of the coefficients for the explanatory variables.^{Footnote 3}

We are then also particularly interested in testing for the appropriateness of the chosen IV approach and apply test routines that account for the problem of many and/or weak instruments in the regression (see, e.g. Roodman 2009). Moreover, as it is typically the case with regional data, we are especially aware of the potential bias induced by a significant cross-sectional dependence in the error term of the model. There are different ways to account for such error cross-sectional dependences implying $ { {\text{Cov}(\nu_{i,t}\nu_{j,t})}\,{\neq}\,{0} } $ for some  t and $ { {i}\,{\neq}\,{j} } $ (see Sarafidis and Wansbeek 2010).

Besides the familiar spatial econometric approach, which assumes certain distance decay in spatial dependence, recently the common factor structure approach has gained considerable attention. The latter specification assumes that the disturbance term contains a finite number of unobserved factors that influence each individual cross-section separately. The common factor model approach is based on the concept of strong cross-sectional dependence, which assumes that all regions, either symmetrically or asymmetrically, are affected rather than just those nearby. Common examples are, for instance, regional adjustment processes to common macroeconomic shocks. We introduce a common factor structure for the error term according to Eq.  (11) in the following way:

$$ u_{i,t} = \mu_{i} + \nu_{i,t} \,, \quad \nu_{i,t} = \sum_{m=1}^{M}{\phi_{m,i}\textbf{f}_{m,t}} + \epsilon_{i,t} \,, {} $$

(12)

where $ { {\textbf{f}_{m,t}}\,{=}\,{(}{f_{1,t,}}\,{\ldots,}\,{f_{M,t}}{)}{^\prime} } $ denotes an $ { {M}\,{\times}\,1 } $ vector of individual-invariant time-specific unobserved effects, $ { {\phi_{i}}\,{=}\,{(\phi_{1,i},}\,{\ldots,}\,{\phi_{M,i})^\prime} } $ is an $ { {{M}\,{\times}\,1} } $ vector of factor loadings and  $ { \epsilon_{i,t} } $ is a pure idiosyncratic error component with zero mean and constant variance. Cross-sectional dependence in turn leads to inconsistent estimates if regressors are correlated with the unspecified common variables or shocks. There are different proposals in the literature on how to account for unobserved factors.

For dynamic panel estimators with short time dimension, Sarafidis and Robertson (2009) propose applying time-specific demeaning, which alleviates the problem of parameter bias if the variance of the individual factor loadings for the common factor models is small. Alternatively, if the impact of the common factor varies considerably by cross-sections, there are different estimation techniques that account for this type of cross-sectional dependence by using cross-section averages of the dependent and independent variables as additional regressors (see, e.g. Pesaran 2006).

Recently, various testing procedures have been developed to check for the presence of cross-sectional dependence. Among the most commonly applied routines is Pesaran's (2007) extension to the standard Breusch and Pagan LM  test. The so-called Cross-Section Dependence (CD) test is based on the pairwise correlation coefficient of residuals from a model specification that ignores the potential presence of cross-sectional dependence. However, as Sarafidis and Wansbeek (2010) point out, the CD-Test has the weakness that it may lack power to detect the alternative hypothesis under which the sign of the elements of the error covariance matrix alternates (thus for positive and negative correlation in the residuals, e.g. for factor models with zero mean factor loadings).

Moreover, the test statistic requires normality of the residuals. Sarafidis et  al. (2009) propose an alternative testing procedure that does not require normality and is valid for fixed  T and large  N. The testing approach, which is designed for the Arellano and Bond (1991) and Blundell and Bond (1998) GMM estimators, is based on the Diff-in-Hansen test for overidentifying restrictions. The latter is also known as the  C-statistic and is defined according to Eichenbaum et  al. (1988) as the difference between two Sargan (1958)/Hansen (1982)  J-statistics for an unrestricted and restricted IV/GMM-model. The aim of the test is to examine whether there is still (heterogeneous) cross-sectional dependence in the residuals after time-specific demeaning in the logic of Sarafidis and Robertson (2009). The test has the following form:

$$ C_{\text{CD-GMM}} = (S_{\text{F}} - S_{\text{R}}) \stackrel{d}{\rightarrow} \chi^{2}_{h_{\text{d}}} \,, {} $$

(13)

where  $ { h_{\text{d}} } $ is the number of degrees of freedom of the test statistic as difference between the set of instruments (number of moment conditions) in the full model  $ { (S_{\text{F}}) } $ and the restricted model  $ { (S_{\text{R}}) } $, where the GMM model has either the Arellano–Bond or the Blundell–Bond form augmented by time-specific dummy variables. The corresponding null hypothesis of the Sargan's difference-test tests is that there is homogeneous cross-sectional dependence in the model versus the alternative of heterogeneous cross-sectional dependence. If only homogeneous cross-sectional dependence is present, the inclusion of time-specific dummies variables is sufficient to remove any bias in the estimation approach, see, e.g. Sarafidis and Robertson (2009).^{Footnote 4}

4 What does the empirical literature say?

Testing for the empirical validity of the neoclassical migration model yields rather mixed results, when looking at recent empirical evidence for European data. Here, regional (un)employment disparities are often shown to be important factors in determining migratory flows. On the contrary, the influence of regional wage or income levels is difficult to prove in many empirical examinations (see, e.g. Pissarides and McMaster 1990, as well as Jackman and Savouri 1992, for British regions; Westerlund 1997, for inter-regional migration in Sweden; Devillanova and Garcia-Fontes 2004, for Spain). For the Italian case, Daveri and Faini (1999) show that the regional wage level corresponds to the theoretically expected signal for the gross outward migration from southern to northern regions. Similar results are found in Fachin (2007).

Napolitano and Bonasia (2010) show that although the coefficients for Italian labor market variables in the neoclassical migration model show the expected sign, due to the complexity of the internal migration process, the baseline Harris–Todaro approach neglects important variables such as agglomeration forces measured by population density and human capital. The latter variables are also found to be significant in addition to the standard labor market variables in an inter-regional migration model for the Polish transition process (see Ghatak et  al. 2008). This indicates that the augmented migration model may be in order.

Turning to the case of German interregional migration, Decressin (1994) examined gross migration flows for West German states up to 1988. His results show that a wage increase in one region relative to others causes a disproportional rise in the gross migration levels in the first region. On the other hand, a rise in the unemployment in a region relative to others disproportionally lowers the gross migration levels. Decressin does not find a significant connection between bilateral gross migration and regional differences in wage level or unemployment when purely cross-sectional estimates are considered.

Difficulties in proving a significant influence of regional wage decreases on the migratory behavior within Germany are also found in earlier empirical studies based on micro-data directly addressing the motivation for individual migratory behavior in Germany. Among these are Hatzius (1994) for the West German states, and Schwarze and Wagner (1992), Wagner (1992), Burda (1993) and Büchel and Schwarze (1994) for East Germany. Subsequent studies succeed in qualifying the theoretically unsatisfactory result of an insignificant wage influence. Schwarze (1996) shows that by using the expected wage variables instead of the actual ones, the wage drop between East German and West German states has a significant influence on the migratory behavior.^{Footnote 5} In a continuation of the work (Burda 1993), Burda et  al. (1998) also indicate a significant non-linear influence on household income.

Contrary to earlier evidence, in recent macroeconomic studies with an explicit focus on intra-German East–West migration flows, regional wage rate differentials are broadly tested to significantly affect migration flows (see, e.g. Parikh and Van Leuvensteijn 2003, Burda and Hunt 2001, Hunt 2006, as well as Alecke et  al. 2010). The study of Parikh and Van Leuvensteijn (2003) augments the core migration model with regional wage and unemployment differentials as driving forces of interregional migration by various indicators such as regional housing costs, geographical distance and inequality measures. For the sample period 1993 to 1995, the authors find a significant non-linear relationship between disaggregated regional wage rate differences and East–West migration (of a U-shaped form for white-collar workers and of inverted U-form for blue-collar workers), while unemployment differences showed to be insignificant. The relationship between income inequality and migration did not turn out to be strong.

According Burda and Hunt (2001), wage rate differentials and especially the fast East–West convergence are also a significant indicator in explaining the state-to-state migration patterns observed. Using data from 1991 to 1999, the authors find that the decline in East–West migration starting from 1992 onwards can almost exclusively be explained by wage differentials and the fast East–West wage convergence, while unemployment differences do not seem to play an important part in explaining actual migration trends. The study by Hunt (2006) comes closest to the research focus in this paper. Hunt (2006) also estimates the migration response to labor market signals by age groups. The author finds that young potential emigrants are more sensitive to wages than older cohorts. At the same time young age groups are found to be less sensitive to unemployment levels in the origin region. Hunt (2006) argues that the latter finding is likely to drive the migration pattern pooled over all age groups and thus gives a motivation for the dominance of wage rate signals in aggregate data as, e.g., reported by Burda and Hunt (2001).

Alecke et  al. (2010) apply a Panel VAR to analyze the simultaneous impact of labor market variables to migration and vice versa for German federal states between 1991 and 2006. The results broadly support the neoclassical migration model and show that migration itself has an equilibrating effect on labor market differences. The authors also find evidence for structural differences between the German west and east macro regions in the migration equation, which is similar to findings for an Italian ‘empirical puzzle’ with a distinct North–South division in terms of the magnitude of migration responses to labor market signals (see, e.g. Fachin 2007, and Etzo 2007).

The recent results for Germany also show that the specific time period used for estimation may have significant impact on the estimation results. Especially for the first years after the reunification several structural breaks are in order that may partly explain the results between earlier and recent contributions with respect to German internal migration. However, except for Alecke et  al. (2010), none of the empirical papers take into account recent sample observations incorporating information about the second wave of strong East–West outmigration around the year 2001. The allocation of higher weights to recent sample observations may in turn minimize the risk of biasing the results in the light of distinct macro regional structural breaks.^{Footnote 6}

5 Data and stylized facts

We use the heterogeneous findings in the international and German empirical literature regarding the neoclassical migration model as a starting point for an updated regression approach based on German spatial planning units between 1996 and 2006. For empirical estimation we use regional data for the  97 German Spatial Planning Regions (so called Raumordnungsregionen) as the level of analysis for spatial migration processes within Germany (see, e.g. Bundesinstitut für Bau-, Stadt-, und Raumforschung 2010, for details about the concept of Spatial Planning Regions).^{Footnote 7}

We use a set of variables comprising regional net migration, population, real income, unemployment rate, human capital endowment, international competitiveness of regions and commuting flows. The latter have been included to account for an alternative adjustment mechanism to balance labor market disequilibria. Human capital is defined as the percentage share of regional employment of people with a university degree (including universities of applied science) in the total employment covered by the social security system (sozialversicherungspflichtig Beschäftigte).^{Footnote 8} We also include two sets of dummy variables. 1.  Binary dummy variables for the  16 federal states to capture macro regional differences (see, e.g. Suedekum 2004). This may be especially important to account for structural differences between West and East Germany (see, e.g. Alecke et  al. 2010, for recent findings). 2.  Binary dummy variables for different regional settlement types ranging from metropolitan agglomerations to rural areas (in total  7 different categories based on their absolute population size and population density). As Napolitano and Bonasia (2010) point out, variables measuring population density may be an important factor in explaining the regional amenities. Variable definitions and descriptive statistics are provided in Tables  1 to  3.

Table 1 Variable definition and data sources

Full size table

Table 2 Descriptive statistics for continuous variables in the sample

Full size table

Table 3 Descriptive statistics for binary variables in the sample

Full size table

To highlight regional and macro regional differences for net migration and explanatory variables, Fig.  1 visualizes spatial differences for the sample means of net in-migration and labor market variables for the period 1996–2006. Net in-migration flows are categorized into labor force relevant age groups between  18 and  65 years as well as non-labor force relevant age groups. For labor force migration, the figure shows that throughout the sample period the East German regions on average lost a considerable fraction of their population levels through net out-migration. Exceptions are the economic core regions around Berlin/Brandenburg and in the south-west of Saxony. Also, the western regions along the border to East Germany experienced net outflows. On the other hand, the northern West German regions around the urban agglomerations Hamburg and Bremen are among the net recipient regions as well as the western agglomerated regions in the Rhineland (around the metropolitan areas Cologne and Düsseldorf) and the southern West German regions in Baden Württemberg and Bavaria.

Looking at net migration trends for non-labor market relevant age groups, the picture is less clear. We can see from Fig.  1 that both the north German coastal regions as well as the southern border regions gain considerable population through net in-migration. This trend may be interpreted in terms of regional amenities such as topographical advantages, which attract migration flows. The relative difference is especially observable for the East German coastal zone in Mecklenburg-Vorpommern. The spatial distribution of real per capita income and unemployment rates nevertheless show a distinct West–East division. The regions with the highest income levels for the sample period are the northern regions around Hamburg, the Western regions in the Rhineland as well as large parts of the southern states Baden-Württemberg and Bavaria. Since these regions were also found to have large net in-migration flows (both overall as well as for the workforce relevant age groups), this may give a first hint of the positive correlation of migration flows and regional income levels as suggested by the neoclassical migration model. The opposite case is supposed to hold for large regional unemployment rates. Especially for the East German Spatial Planning Regions high unemployment rates seem to match with net population losses. To check the correlation of these variables more in depth, the next section presents the results of the estimation exercise.

6 Empirical results for the neoclassical migration model

6.1 Aggregate findings

For the migration model of Eqs.  (7) and  (8) we apply different static and dynamic panel data estimators. Before estimating the empirical migration model we check the time series properties of the variables involved in order to avoid the risk of running a spurious regression for non-stationary variables (with moderate $ { {T}\,{=}\,{11} } $). We therefore report test results of different panel unit root tests including recently proposed methods by Levin et  al. (2003) and Im et  al. (2003), as well as Pesaran's (2007) CADF test. The latter approach has the advantage that it is relatively robust with respect to cross-sectional dependence in the variable, even if the autoregressive parameter is high (see, e.g. Baltagi et  al. 2007, as well as de Silva et  al. 2009, for extensive Monte Carlo simulation evidence). As the results in Table  4 show, for almost exclusively all variables and test specifications the null hypothesis of non-stationarity of the series under observation can be rejected.^{Footnote 9} Given this overall picture of the panel unit root tests together with the theoretically motivated assumption that migration flows are transitory processes between two labor market equilibria, it seems reasonable to handle the variables as stationary processes so that we can run untransformed regressions without running the risk of spurious regression results.

Table 4 Results of panel unit root tests (p-values) for variables in the migration model

Full size table

For estimation we start from an unrestricted presentation of the baseline model including the core labor market variables real income  (y) and unemployment rates  (u) and test for parameter constraints according to Eqs.  (9) and  (10). As the results in Table  5 show, for almost all model specifications the null hypothesis for equal parameter size cannot be rejected on the basis of standard Wald tests. Also, compared to the static specification in column  2, the relative root mean squared error (RMSE) criterion of the model strongly increases if we add a dynamic component to the migration equation. The relative RSME for each estimator is thereby computed as the ratio of the model's RMSE and the static POLS benchmark specification in column  1. A value smaller than one indicate that the model has a better predictive performance than the benchmark POLS.

Table 5 Baseline specifications of the neoclassical migration model for German spatial planning regions

Full size table

As discussed above the  $ { \lambda } $-class estimators are potentially biased in a dynamic specification. Since the coefficient of the lagged dependent variable turns out to be highly significant, we also compute a bias-corrected FEM specification as well as the Arellano and Bond (1991) and Blundell and Bond (1998) system GMM estimators. According to the relative RMSE criterion the Blundell–Bond system GMM specification has the smallest prediction error. The coefficients for labor market signals are statistically significant and of the expected signs. Moreover, the SYS-GMM specification passes standard tests for autocorrelation in the residuals  ($ { m_{1} } $ and  $ { m_{2} } $ statistics proposed by Arellano and Bond 1991) as well as the Hansen  J-statistic for instrument validity. The reported  C-statistic for the exogeneity of the instruments in the level equation shows the validity of the augmented approach in extension to the standard Arellano–Bond first difference model.

We then use the SYS-GMM approach to test for the significance of different extensions of the baseline Harris–Todaro model. We start by including a dummy variable for the East German Spatial Planning Regions (see Table  6). The motivation for this approach is to test for the significance of the so-called East German empirical puzzle, where a relatively high degree of migratory interregional immobility was found to coexist with large regional labor market disparities. Fachin (2007) and Etzo (2007) report similar results for Italian South–North migration trends, while Alecke and Untiedt (2000) as well as Alecke et  al. (2010) identify such effects for German East–West migration throughout the 1990s.^{Footnote 10}

Table 6 Augmented neoclassical migration model for German spatial planning regions

Full size table

The results in Table  6 for the period 1996 to 2006 report a statistically significant positive East German dummy that indicates higher net in-migration balances for the East German spatial planning regions than their labor market performance would suggest. To obtain further insights we also estimate a specification that includes federal state level fixed effects. The estimation results for the state dummies in the baseline model are shown in Fig.  2. As the figure highlights, for all six East German state dummies we obtain statistically significant and positive coefficients. Negative coefficients are found for the West German states Baden Württemberg, Bavaria and Hessen. A Wald test for the joint effect of the set of state dummies turns out to be highly significant. However, most importantly, for both models including the East German dummy and the set of state dummies, the impact of labor market variables is still of the expected sign and higher than in the baseline specification. In line with Suedekum (2004) for West Germany, the results thus show that macro regional differences matter, yet there are no qualitative effects on the estimated coefficients that hint to a systematic rejection of the neoclassical migration model.

Regarding further variables in the augmented variable set, the results show that higher interregional net in-commuting levels are negatively correlated with the net in-migration rate. This supports our basic theoretical expectations above that both types are alternative adjustment mechanisms to reduce labor market disparities. The binary dummy variables for different settlement types (classified by size of local urban centers and population density, see Table  1 for details) reveal further structural differences in inter-regional migration patterns. Next to rural areas with low population density, agglomeration regions of Types  2 and  4 also show significantly lower net in-migration rates relative to benchmark category Type  1 (highly agglomerated area with a regional urban center above 100,000 persons and population density above 300 inhabitants$ { / } $sqm). This may hint at the role played by regional centers of agglomeration in attracting migration flows and may be interpreted in favor of a ‘re-urbanization’ process in Germany for the period 1996 to 2006. Similar trends have also been reported by Swiaczny et  al. (2008).^{Footnote 11}

Finally, testing for the effects of regional human capital endowments and international competitiveness shows mixed results. While the proxy for the latter variable in terms of foreign turnover relative to total turnover in manufacturing sector industries shows the expected positive effect on net in-migration, the regional endowment with human capital is found to be insignificant. This finding corresponds to recent results for Spain between 1995–2002, where regional differences in human capital were not found to be helpful in predicting internal migration flows (see Maza and Villaverde 2004). The latter may be explained by the fact that not the region's specific stock of human capital but rather the individual endowment of the prospective migrant is the appropriate level of measurement. However, the latter variable is not observable for regional data.

In order to check the appropriateness of our augmented SYS-GMM specifications, we perform a variety of postestimation tests for instrument appropriateness and temporal and cross-sectional dependence of the error term. The test results are reported in Table  6. With respect to IV appropriateness and temporal autocorrelation of the error terms, all model specifications show satisfactory results. In order to control cross-sectional error dependence due to unobserved common factors, we first add year dummies to our model specification, which also turn out to be jointly significant. We then apply Sargan's difference test for the SYS-GMM model $ { (C_{\text{CD-GMM}}) } $ as described above, in order to check for the nature of the cross-sectional dependence given the impact of unobserved common factors.

In order to run the test, we first need to judge whether the set of explanatory variables (excluding instruments for the lagged endogenous variable) is exogenous with respect to the combined error term. This can easily be tested by means of a Sargan/Hansen  J-statistic based overidentification test. As the results in Table  6 show, only those model specification that include fixed state effects pass the overidentification test for the vector of explanatory variables. For these equations we can then apply $ { C_{\text{CD-GMM}} } $ from Eq.  (13) in order to test for the existence of heterogeneous factor loadings for the common factor structure of the error terms as proposed by Sarafidis et  al. (2009). The test results do not indicate any sign of misspecification, including period-fixed effects for standard significance levels, hinting at homogeneous responses to common shocks. In sum, the augmented neoclassical migration equation is shown to be an appropriate representation of the data generating process and highlights the role of key labor market variables in explaining net in-migration rates in German regions.

6.2 Disaggregate estimates by age groups

Given the supportive findings for the neoclassical migration model at the aggregate level, we finally aim to check the sensitivity of the results when different disaggregated age groups are used. We are especially interested in analyzing whether the estimated coefficients for the labor market signals change for different age groups. Indeed, the estimation results show that the migratory response to labor market variables is much higher for workforce relevant age groups. For both the baseline and augmented model, the resulting coefficients for real income and unemployment rate differences together with 95% confidence intervals are plotted in Fig.  3.^{Footnote 12}

The coefficient for real income differences in Fig.  3 shows a clear inverted U-shaped pattern when plotted for the different age groups in ascending order. While for migrants aged up to  18 years real income differences do not seem to matter, for migrants aged between  18 to  25 and  25 to  30 years the estimated coefficient is statistically significant and much higher compared to the overall migration equation in Table  6. For older age groups the effect reduces gradually. The migration responses are found to be very similar for the baseline and augmented migration specification (see Fig.  3). Similar results were found for regional unemployment rate differences, which are shown to be almost equally important for age groups up to  50. It is only for elderly age groups that the coefficients turn out to be of smaller size and partly insignificant. If we look at the distribution of the state-level fixed effects for each estimated age-group specification, the estimation results show that the positive dummy variable coefficients for the East German states particularly hold for the workforce relevant age groups. The results are graphically shown in Fig.  4 for the baseline migration model.

Finally, Table  7 computes the ‘relative importance’ of the labor market variables by age groups in determining net migration flows. Thereby, the relative importance refers to the quantification of an individual regressor's contribution in a multiple regression model (see, e.g. Grömping 2006, for an overview). This allows us to further answer the question as to how far our estimation results support the prominent role of labor market conditions in guiding internal migration rates (of the workforce population) in Germany. Table  7 computes two specifications based on the squared correlation of the respective regressor with the dependent variables (univariate  $ { R^{2} } $, specification  A) as well as the standardized estimated SYS-GMM coefficients from the augmented migration model. This latter metric for assessing the relative importance of regressors has an advantage over the simple benchmark in specification  A since it accounts for the correlation of regressors. As the table shows, both methods assign a significant explanatory share to the two key labor market variables in predicting migration flows, especially for the workforce population (up to  50% joint contribution in Specification  A for age group  18 to  25 years and even up to  65% for age group  25 to  30 in Specification  B). The SYS-GMM thereby on average assigns a stronger weight to real income differences in explaining net in-migration relative to unemployment differences. However, the overall picture confirms our interpretation of the regression tables in assigning a prominent role to labor market imbalances in driving German internal migration.

Table 7 Relative contribution of labor market variables in explaining migration flows

Full size table

7 Conclusion

In this paper, we have analyzed the explanatory power of the neoclassical migration model to describe aggregate and age-group specific internal migration trends for  97 German Spatial Planning regions throughout the period 1996–2006. Our results are based on model specifications for dynamic panel data estimators and give strong evidence in favor of the neoclassical inspired Harris–Todaro model. Both real income differences as well as unemployment rate disparities are found to be statistically significant with the expected signs. That is, a real income increase in region  i relative to region  j leads to higher net migration inflows to  i from  j; on the contrary, a rise in the regional unemployment rate in  i leads to lower net inflows. Given these responses to labor market signals, migration flows may be seen as a spatial adjustment mechanism and equilibrate regional labor market imbalances.

The results of the standard neoclassical migration model remain stable if commuting flows, regional human capital endowment, the region's international competitiveness as well as differences in the settlement structure are added as further explanatory variables. The inclusion of the regional net in-commuting rate shows a negative correlation with migration underlying the substitutive nature of the two variables. Also, an increasing level of international competitiveness attracts further in-migration flows. We also find heterogeneity for different types of regional settlement structure proxied by population density and we observe persistent structural differences for the two East–West macro regions (by including individual federal state level fixed effects or a combined East German dummy). Most importantly, the impact of core labor market variables is still of the expected sign when further variables are added. In line with earlier empirical studies, the results thus show that macro regional differences matter, yet there are no qualitative effects on the estimated coefficients that hint to a systematic rejection of the neoclassical migration model.

We finally estimate the migration model for age-group specific subsamples of the data. Here, the impact of labor market signals is found to be of greatest magnitude for workforce relevant age groups (18 to 25,  25 to  30 and  30 to  50 years). Computing the ‘relative importance’ of labor market variables by age groups in a multiple regression framework with a broader set of controls, our results show that for young cohorts up to  65% of all migratory movements can be explained by differences in regional income levels and unemployment rates. This latter result emphasizes the prominent role played by labor market conditions in guiding internal migration rates of the working age population in Germany.

Executive summary

Given that the degree of geographical mobility of the labor force is an important indicator of the conduct of labor market policies, this paper tests the empirical validity of the neoclassical migration model in predicting German internal migration flows. The neoclassical migration model places special emphasis on the labor market dimension of migration and basically relates migration-induced population changes to the relative income (or wage) and employment situation found in the regions of origin and destination, respectively. Thereby, regions with a higher relative income level and lower unemployment rates are expected to exhibit positive net in-migration rates. In its response to regional labor market disparities, migration itself should work as an equilibrating mechanism for balancing these labor market differences. Higher in-migration flows in a certain region are expected to reduce the regional wage level due to an increase in labor supply and at the same time increase the regional unemployment rate. Thus, from the perspective of economic policy making, the empirical implications of the neoclassical migration model are important, for instance, to assess whether labor mobility can act as an appropriate adjustment mechanism in integrated labor markets facing asymmetric shocks. Though the neoclassical migration model is widely used as a policy simulation and didactic tool, the international empirical evidence so far has provided rather mixed results. To shed more light on the German case, we estimate static and dynamic migration functions for  97 Spatial Planning Regions between 1996 and 2006 using key labor market signals, including income and unemployment differences among a broader set of explanatory variables, including human capital, regional competitiveness, regional settlement structure and interregional commuting as alternative adjustment process for labor market disequilibria. In addition to an aggregate specification, we also estimate the model for age-group related subsamples. We estimate the migration equation by means of dynamic panel data estimation using instrumental variable techniques and place special emphasis on postestimation testing in order to control for right-hand side endogeneity and cross-sectional dependence of the error term.

Our results give empirical support for the main transmission channels identified by the neoclassical framework, both at the aggregate level as well as for age-group specific estimates. At the aggregate level, real income differences and unemployment rate disparities are found to be statistically significant with the expected signs. The results remain stable if commuting flows, regional human capital endowments, the region's international competitiveness as well as differences in the settlement structure are added as further explanatory variables. The inclusion of the regional net in-commuting rate shows a negative correlation with interregional migration, underlying the substitutive nature of the two variables. Further, an increasing level of international competitiveness attracts additional in-migration flows. We also find heterogeneous migration patterns for different types of regional settlement structure proxied by population density and we observe persistent structural differences for the two East–West macro regions (by including individual federal state level fixed effects or a combined East German dummy). However, it is important to note that the impact of core labor market variables is still of the expected sign when further variables are added. In line with earlier empirical studies, the results show that macro regional differences matter, yet there are no qualitative effects on the estimated coefficients that hint toward a systematic rejection of the neoclassical migration model. Looking at the estimation results according to age groups, the impact of labor market signals is tested to be of greatest magnitude for workforce relevant age-groups and especially young cohorts between the ages of  18 to  25 and  25 to  30. We finally compute measures for the relative importance of labor market variables by age groups in a multiple regression framework including a broader set of controls. Our results show that for young cohorts up to  65% of the migratory movements can be explained by differences in regional income levels and unemployment rates. This latter result emphasizes the prominent role played by labor market conditions in determining internal migration rates of the workforce population in Germany.

Kurzfassung

Vor dem Hintergrund der arbeitsmarktpolitischen Relevanz von geografischen Mobilitätsraten der Erwerbsbevölkerung untersucht dieses Papier die empirische Validität des neoklassischen Migrationsmodells zur Erklärung interner Migrationsströme in Deutschland. Das neoklassische Migrationsmodell analysiert insbesondere die Arbeitsmarktdimension von Migrationsströmen und verbindet migrationsinduzierte Bevölkerungsveränderungen mit relativen Einkommens- und Beschäftigungssituationen in der Ursprungs- und Zielregion. Dabei wird angenommen, dass Regionen mit einem relativ höheren Einkommensniveau und niedriger Arbeitslosenquote tendenziell Migrationszuwächse verbuchen. Die induzierten Migrationsströme wirken in ihrer Reaktion auf regionale Arbeitsmarkungleichgewichte dann als Gleichgewichtsmechanismus, wobei eine gesteigerte In-Migration durch ein höheres Arbeitskräfteangebot den durchschnittlichen Lohnsatz in der Region reduziert und gleichzeitig die regionale Arbeitslosenquote anhebt. Aus wirtschaftspolitischer Perspektive sind die Implikationen des neoklassischen Migrationsmodells vor dem Hintergrund relevant, als dass so beispielsweise die Fähigkeit von Migrationsströmen zur Ausbalancierung asymmetrischer Schocks in regionalen Arbeitsmärkten untersucht werden kann. Obwohl das neoklassische Migrationsmodell im politischen Diskurs gängigerweise als Simulationsmodell und didaktisches Instrument eine breite Anwendung findet, ist die internationale empirische Evidenz hinsichtlich der zentralen Modellaussagen eher durchwachsen. Um mehr über die empirische Relevanz des neoklassischen Migrationsmodells für Deutschland zu erfahren, schätzt die Arbeit daher dynamische Migrationsgleichungen für  97 deutsche Raumordnungsregionen. Die Schätzungen basieren auf Daten aus dem Zeitraum von 1996 bis 2006 und beinhalten neben den zentralen Arbeitsmarktvariablen zur Messung regionaler Unterschiede in der Einkommenssituation und der Arbeitslosenquote weitere erklärende Faktoren wie den regionalen Besatz an Humankapital, die regionale Wettbewerbsfähigkeit, die Siedlungsstruktur und interregionale Pendlerverflechtungen als alternativen Angleichungsmechanismus für Arbeitsmarktungleichgewichte. Neben einer aggregierten Migrationsgleichung werden auch disaggregierte Modelle für verschiedene Altersgruppen geschätzt. Die Modelle werden als dynamische Paneldatenverfahren unter Verwendung von Instrumentenvariablen-Verfahren geschätzt, wobei ein besonderes Gewicht auf der Überprüfung der Schätzgüte der Modelle liegt. Neben Tests auf Endogenität der Regressoren hinsichtlich des Fehlerterms des Modells wird so auch für Abhängigkeiten über Querschnittseinheiten hinweg kontrolliert.

Sowohl die aggregierten als auch die altersgruppenspezifischen Resultate bestätigen den durch das neoklassische Migrationsmodell vorhergesagten Zusammenhang zwischen interner Migration und regionalen Arbeitsmarktungleichgewichten. Dabei gehen sowohl regionale Einkommensunterschiede als auch Disparitäten in den Arbeitslosenquoten als statistisch signifikant und dem Vorzeichen nach entsprechend der erwarteten Wirkungsrichtung des neoklassischen Modells in die Schätzgleichung ein. Die Resultate für die Arbeitsmarktvariablen bleiben auch dann stabil, wenn weitere Erklärungsgrößen wie der regionale Humankapital-Besatz, die regionale Wettbewerbsfähigkeit und Unterschiede in der Siedlungsstruktur in das empirische Modell aufgenommen werden. Die Hinzunahme von interregionalen Pendlerverflechtungen in die Migrationsgleichung zeigt eine negative Korrelation zwischen den beiden Variablen, was erste Rückschlüsse auf die substitutive Beziehung zwischen diesen Ausgleichsmechanismen für Arbeitsmarktungleichgewichte zulässt. Eine gesteigerte regionale Wettbewerbsfähigkeit hat den erwarteten positiven Einfluss auf Netto-In-Migrationsströme. Die empirischen Ergebnisse verdeutlichen auch die Bedeutung von regionalen Heterogenitäten im Migrationsverhalten, gemessen anhand von Unterschieden in der Siedlungsstruktur und den beiden West-Ost-Makroregionen (diese werden einerseits approximiert durch die Verwendung von fixen Effekten für einzelne Bundesländer sowie durch einen aggregierten Dummy für Ostdeutschland andererseits). Allerdings zeigen die Ergebnisse auch, dass der Effekt der zentralen Arbeitsmarktvariablen durch die Hinzunahme weiterer Variablen in das Modell nicht beeinflusst wird. In Analogie zu früheren empirischen Ergebnissen bestätigt die Untersuchung somit, dass makroregionale Unterschiede zwar von Bedeutung sind, diese jedoch nicht zu einer systematischen Ablehnung des neoklassischen Migrationsmodells führen. Die disaggregierten Ergebnisse für verschiedene Altersgruppen zeigen, dass insbesondere junge Alterskohorten zwischen  18 und  25 sowie  25 und  30 Jahren stark auf Unterschiede im regionalen Lohnniveau und der Arbeitslosenquote reagieren. Um diesen Zusammenhang näher zu untersuchen, werden abschließend statistische Maße zur Bestimmung der relativen Bedeutung der Arbeitsmarktvariablen in einem multiplen Regressionsmodell unter Verwendung weiterer Erklärungsgrößen berechnet. Die Resultate zeigen, dass für junge Kohorten bis zu  65% der Variation in den innerdeutschen Migrationsströmen durch die beiden Arbeitsmarktgrößen Einkommens- und Arbeitslosenquotenunterschiede erklärt werden können. Diese Ergebnisse unterstreichen insgesamt die prominente Rolle von regionalen Arbeitsmarktkonditionen bei der Bestimmung von Migrationsbewegungen der Erwerbsbevölkerung in Deutschland.

Notes

The opposite effect on  $ { \text{NM}_{ij} } $ holds for an increase in $ { \text{HK}\,\uparrow } $, $ { \text{INTCOMP}\,\uparrow } $ and $ { \text{POPDENS}\,\uparrow } $ in region  j.
See e.g. Maza and Villaverde (2004) for a similar definition of the dependent variable.
Of course, a full account of the simultaneity problem may call for a system approach that is also likely to increase the estimation efficiency if there are significant cross-correlations in the error terms for functional forms of the migration and labor market variable equations. However, a fully specified system approach goes beyond the scope of this paper.
The restricted (sub)set of moment conditions thereby only includes instruments from regressors in the vector  $ { X_{i,t} } $ (according to Eq.  (11)) that remain strongly exogenous in the sense that their factor loadings are mutually uncorrelated with the cross-section specific parameter of the common factor. Sarafidis et  al. (2009) propose to likewise test for the exogeneity of a subset of regressors by means of the standard Sargan/Hansen  test for overidentifying restrictions in a first step.
This result is also confirmed by Brücker and Trübswetter (2004). The latter study also focuses on the role of self-selection in East–West migration, finding that East–West migrants receive a higher individual wage compared to their non-migrating counterparts after controlling for the human capital level.
In this paper we account for regional and macro regional results by including East German and state level fixed effects. However, future work should also explicitly test for the poolability of the data for regional subgroups in a partial clustering framework.
We restrict our estimation approach to this period since regional boundaries of the German Spatial Planning Regions changed before and after, which may introduce a measurement problem that is likely to bias our empirical results.
We also checked for the sensitivity of the results when using composite indicators of human capital as discussed by Dreger et  al. (2009), accounting for human capital potential (measured in terms of high school graduates with university qualification per total population between 18–20 years) as well as science and technology related indicators (e.g., patent intensity). The results did not change.
It was only for the (rest of the country) aggregate of the unemployment rate that the Levin–Lin–Chu test could not reject the null of non-stationarity. However, the LLC-test rejects the null hypothesis of an integrated time series if the unemployment rate is transformed into regional differences $ { (\tilde{u}_{ij,t}) } $.
The latter study found that along with a second wave of East–West movements in early 2000 net flows out of East Germany were much higher than expected after controlling its labor market and macroeconomic performance. Since this trend was accompanied by a gradual fading out of economic distortions, this supports the view of ‘repressed’ migration flows for that period.
The authors argue that throughout the process of demographic change in Germany city core regions may gain in demographic terms from young migrants, while suburban and rural areas are expected to face increasing migration losses.
Detailed estimation results for the models are given in the Appendix.

References

Alecke, B., Untiedt, G.: Determinanten der Binnenwanderung in Deutschland seit der Wiedervereinigung. Working Paper. University of Münster (2000)
Alecke, B., Untiedt, G.: Pendlerpotential in den Grenzregionen an der EU-Außengrenze. Methoden, Ergebnisse und Prognosen. In: Riedel, J., Untiedt, G. (Eds.) EU-Osterweiterung und deutsche Grenzregionen. ifo dresden studien 28/II. Ifo Institut für Wirtschaftsforschung e.V., München (2001)
Google Scholar
Alecke, B., Mitze, T., Untiedt, G.: Internal migration, regional labour market dynamics and implications for German East–West-disparities – results from a panel VAR. Jahrb. Regionalwiss./Rev. Reg. Res. 30(2), 159–189 (2010)
Article Google Scholar
Arellano, M., Bond, S.: Some tests of specification for panel data: Monte Carlo evidence and an application to employment equations. Rev. Econ. Stud. 58, 277–297 (1991)
Article Google Scholar
Baltagi, B.: Econometric Analysis of Panel Data. 4th edn. Wiley, Chichester (2008)
Google Scholar
Baltagi, B., Bresson, G., Pirotte, A.: Panel unit root tests and spatial dependence. J.  Appl. Econom. 22(2), 339–360 (2007)
Article Google Scholar
BBSR: Laufende Raumbeobachtungen des Bundesinstitutes für Bau-, Stadt- und Raumforschung. Various issues. Bundesamt für Bauwesen und Raumordnung, Bonn (2009)
Google Scholar
Blundell, R., Bond, S.: Initial conditions and moment restrictions in dynamic panel data models. J.  Econometrics 87, 115–143 (1998)
Article Google Scholar
Büchel, F., Schwarze, J.: Die Migration von Ost- nach Westdeutschland – Absicht und Realisierung. Mitt. Arbeitsmarkt. Berufsforsch. 27(1), 43–52 (1994)
Google Scholar
Brücker, H., Trübswetter, P.: Do the best go west? An analysis of the self-selection of employed East–West Migrants in Germany. IZA Discussion Papers 986. Forschungsinstitut zur Zukunft der Arbeit (IZA), Bonn (2004)
Google Scholar
Burda, M.C.: The determinants of East–West German migration. Some first results. Eur. Econ. Rev. 37, 452–461 (1993)
Article Google Scholar
Burda, M.C., Hunt, J.: From reunification to economic integration: productivity and the labour market in East Germany. Brookings Pap. Eco. Ac. 2, 1–92 (2001)
Article Google Scholar
Burda, M.C., Härdle, W., Müller, M., Werwartz, A.: Semiparametric analysis of German East–West migration intentions: facts and theory. J.  Appl. Econom. 13, 525–541 (1998)
Article Google Scholar
Daveri, F., Faini, R.: Where do migrants go? Oxford Econ. Pap. 51(4), 595–622 (1998)
Article Google Scholar
Decressin, J.: Internal migration in West Germany and implications for East–West salary convergence. Weltwirtsch. Arch. 130, 231–257 (1994)
Article Google Scholar
Destatis: Zu- und Fortzüge nach Altersgruppen (über die Kreisgrenze) – regionale Tiefe: Kreise und kreisfreie Städte. Wanderungsstatistik. Various issues. German Statistical Office, Wiesbaden (2009)
Google Scholar
Devillanova, C., Garcia-Fontes, W.: Migration across Spanish provinces: evidence from the social security records (1978–1992). Investig. Econ. 28(3), pp.461–487 (2004)
Google Scholar
Dreger, C., Erber, G., Glocker, D.: Regional measures of human capital in the European Union. IZA Discussion Paper No.  3919. Forschungsinstitut zur Zukunft der Arbeit (IZA), Bonn (2009)
Google Scholar
Eichenbaum, M., Hansen, L., Singelton, K.: A time series analysis of representative agent models of consumption and leisure under uncertainty. Q.  J. Econ. 103, 51–78 (1988)
Article Google Scholar
Etzo, I.: Determinants of interregional migration in Italy: a panel data analysis, MPRA Paper No.  5307. University Library of Munich, Munich (2007)
Google Scholar
Everaert, G., Pozzi, L.: Bootstrap-based bias correction for dynamic panels. J.  Econ. Dyn. Control 31(4), 1160–1184 (2007)
Article Google Scholar
Evers, G.: Simultaneous models for migration and commuting: macro and micro economic approaches. In: van Dijk, J. et  al. (Eds.) Migration and Labour Market Adjustment. Springer, Dordrecht (1989)
Google Scholar
Fachin, S.: Long-run trends in internal migrations in Italy: a study on panel cointegration with dependent units. J.  Appl. Econom. 51(4), 401–428 (2007)
Article Google Scholar
Federal Employment Agency: Arbeitsmarktstatistik der Bundesanstalt für Arbeit. Kreisdaten. Various issues. Available at: http://statistik.arbeitsagentur.de (2009). Accessed: July 2010
Ghatak, S., Mulhern, A., Watson, J.: Inter-regional migration in transition economies: the case of Poland. Rev. Dev. Econ. 12(1), 209–222 (2008)
Article Google Scholar
Greenwood, M.J., Hunt, G.L., Rickman, D.S., Treyz, G.I.: Migration, regional equilibrium, and the estimation of compensating differentials. Am. Econ. Rev. 81, 1382–1390 (1991)
Google Scholar
Grömping, U.: Relative importance for linear regression in  R: the package relaimpo. J.  Stat. Softw. 17(1), 1–27 (2006)
Article Google Scholar
Hansen, L.: Large sample properties of generalised method of moments estimators. Econometrica 50, 1029–1054 (1982)
Article Google Scholar
Harris, J.R., Todaro, M.P.: Migration, unemployment and development: a two sector analysis. Am. Econ. Rev. 60, 126–142 (1970)
Google Scholar
Hatzius, J.: Regional migration, unemployment and vacancies: evidence from West German microdata. Applied Economics Discussion Paper Series Nr.164, University Oxford (1994)
Hunt, J.: Staunching emigration from East Germany: age and the determinants of migration. J.  Eur. Econ. Assoc. 4(5), 1014–1037 (2006)
Article Google Scholar
Im, K., Pesaran, M., Shin, Y.: Testing for unit roots in heterogeneous panels. J.  Econometrics 115, 53–74 (2003)
Article Google Scholar
Jackman, R., Savouri, S.: Regional migration in Britain: an analysis of gross flows using NHS Central Register data. Econ.  J. 102(415), 1433–1450 (1992)
Article Google Scholar
Kiviet, J.: On bias, inconsistency, and efficiency of various estimators in dynamic panel data models. J.  Econometrics 68(1), 53–78 (1995)
Article Google Scholar
Levin, A., Lin, C., Chu, C.: Unit root tests in panel data: asymptotic and finite-sample properties. J.  Econometrics 115, 53–74 (2002)
Google Scholar
Maza, A., Villaverde, J.: Interregional migration in Spain: a semiparametric approach. Rev. Reg. Stud. 34(2), 156–171 (2004)
Google Scholar
Napolitano, O., Bonasia, M.: Determinants of different internal migration trends: the Italian experience. MPRA Paper No.  21734. University Library of Munich, Munich (2010)
Google Scholar
Nickell, S.: Biases in dynamic models with fixed effects. Econometrica 49, 1417–1426 (1981)
Article Google Scholar
Parikh, A., Van Leuvensteijn, M.: Interregional labour mobility, inequality and wage convergence. Appl. Econ. 35, 931–941 (2003)
Article Google Scholar
Pesaran, M.H.: Estimation and inference in large heterogeneous panels with a multifactor error structure. Econometrica 74(4), 967–1012 (2006)
Article Google Scholar
Pesaran, M.H.: A simple panel unit root test in the presence of cross-section dependence. J.  Appl. Econom. 22(2), 265–312 (2007)
Article Google Scholar
Pissarides, C., McMaster, I.: Regional migration, wages and unemployment: empirical evidence and implications for policy. Oxford Econ. Pap. 42, 812–831 (1990)
Google Scholar
Puhani, P.A.: Labour mobility – an adjustment mechanism in Euroland? Ger. Econ. Rev. 2(2), 127–140 (2001)
Article Google Scholar
Rainer, H., Siedler, T.: Social networks in determining migration and labour market outcomes: evidence from the German reunification. Econ. Transit. 17(4), 739–767 (2009)
Article Google Scholar
Roodman, D.: How to Do xtabond2: An introduction to ‘difference’ and ‘system’ GMM in Stata. Stata  J. 9(1), 86–136 (2009)
Google Scholar
Sarafidis, V., Wansbeek, T.: Cross-sectional dependence in panel data analysis. MPRA Paper No.  20367. University Library of Munich, Munich (2010)
Google Scholar
Sarafidis, V., Robertson, D.: On the impact of error cross-sectional dependence in short dynamic panel estimation. Economet.  J. 12(1), 62–81 (2009)
Article Google Scholar
Sarafidis, V., Yamagata, T., Robertson, D.: A test of cross section dependence for a linear dynamic panel model with regressors. J.  Econometrics 148(2), 149–161 (2009)
Article Google Scholar
Sargan, J.: The estimation of economic relationships using instrumental variables. Econometrica 26, 393–415 (1958)
Article Google Scholar
Schwarze, J.: Beeinflusst das Lohngefälle zwischen Ost- und Westdeutschland das Migrationsverhalten der Ostdeutschen? Allg. Stat. Arch. 80(1), 50–68 (1996)
Google Scholar
Schwarze, J., Wagner, G.: Abwanderung von Arbeitskräften und Einkommenspolitik. DIW Wochenbericht 59, 58–61 (1992)
Google Scholar
Sevestre, P., Trognon, A.: Dynamic linear models. In: Matyas, L., Sevestre, P. (Eds.) The Econometrics of Panel Data. A Handbook of the Theory with Applications. 2nd edn. Springer, Dordrecht et  al. (1995)
Google Scholar
Silva, S., Hadri, K., Tremayne, A.: Panel unit root tests in the presence of cross-sectional dependence: finite sample performance and an application. Economet.  J. 12(2), 340–366 (2009)
Article Google Scholar
Sjaastad, L.: The costs and returns of human migration. J.  Polit. Econ. 70, 80–93 (1962)
Article Google Scholar
Suedekum, J.: Selective migration, union wage setting and unemployment disparities in West Germany. Int. Econ.  J. 18(1), 33–48 (2004)
Article Google Scholar
Swiaczny, F., Graze, P., Schlömer, C.: Spatial impacts of demographic change in Germany. Urban population processes reconsidered. Z.  Bevölkerungswiss. 33, 181–205 (2008)
Article Google Scholar
Todaro, M.: A model for labor migration and urban unemployment in less developed countries. Am. Econ. Rev. 59(1), 138–148 (1969)
Google Scholar
VGRdL: Volkswirtschaftliche Gesamtrechnungen der Bundesländer (Regional Accounts for German States). Available at: https://vgrdl.de (2009). Accessed: July 2010
Wagner, G.: Arbeitslosigkeit, Abwanderung und Pendeln von Arbeitskräften der Neuen Bundesländer. Soz. Fortschr. 41(4), 84–89 (1992)
Google Scholar
Westerlund, O.: Employment opportunities, wages and international migration in Sweden 1970–1989. J.  Regional Sci. 37, 55–73 (1997)
Article Google Scholar

Download references

Acknowledgements

We thank Björn Alecke for helpful comments regarding the proper estimation of regional migration functions and help with the dataset.

Author information

Authors and Affiliations

RWI, Hohenzollernstr. 1–3, 45128, Essen, Germany
Timo Mitze
Ruhr University Bochum, Bochum, Germany
Timo Mitze
ifo Munich, Munich, Germany
Janina Reinkowski

Authors

Timo Mitze
View author publications
You can also search for this author in PubMed Google Scholar
Janina Reinkowski
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Timo Mitze.

Appendix

1.1 Baseline and augmented regression results by age groups

Table 8 Baseline migration model based on system GMM estimation

Full size table

Table 9 Augmented migration model based on system GMM estimation

Full size table

Rights and permissions

Reprints and permissions

About this article

Cite this article

Mitze, T., Reinkowski, J. Testing the neoclassical migration model: overall and age-group specific results for German regions. ZAF 43, 277–297 (2011). https://doi.org/10.1007/s12651-010-0046-2

Download citation

Received: 04 July 2010
Accepted: 15 November 2010
Published: 24 December 2010
Issue Date: March 2011
DOI: https://doi.org/10.1007/s12651-010-0046-2

Testing the neoclassical migration model: overall and age-group specific results for German regions

Abstract

Zusammenfassung

1 Introduction

2 The neoclassical migration model

3 Econometric specification

3.1 Functional form of the empirical migration equation

3.2 Choice of estimation technique and model misspecification tests

4 What does the empirical literature say?

5 Data and stylized facts