Geographic Dependency of Population Distribution

Fujimoto, Shouji; Mizuno, Takayuki; Ohnishi, Takaaki; Shimizu, Chihiro; Watanabe, Tsutomu

doi:10.1007/978-3-319-20591-5_14

Geographic Dependency of Population Distribution

Conference paper
Open Access

13k Accesses
2 Citations
5 Altmetric

Part of the book series: Springer Proceedings in Complexity ((SPCOM))

Abstract

The agglomeration effect of population, which explains why many people live near places where many other people also live, is one important interaction that influences human population. We examine the agglomeration effect by measuring the distribution of the logarithmic differences between populations living in two places separated by some distance. The shapes of the distributions of the logarithmic differences closely resemble each other without depending on the regions or the country in cases of small scale of separation distance. This result suggests a unified explanation to understand the population distributions of various regions.

You have full access to this open access chapter, Download conference paper PDF

1 Introduction

Population distribution has been studied for many decades. Zipf’s law [1], which argues that the size distribution of a city’s population is a power-law, is known well [2–6]. However, a problem exists: how to define the area of cities when we observe population distributions. The tail of a power-law distribution is composed of megacities. By dividing megacities into several smaller cities, the distribution’s tail becomes thin. Because of the different definitions of a city, population distribution is not a power-law distribution but a log-normal one [7–9]. City areas have been decided by geographical, historical, and administrative factors. Rozenfeld et al. proposed a method that decided a city’s area by a city clustering algorithm [10]. In this research, we divide spatial regions by a method that ignores the shape of cities to find the properties of population distribution that do not depend on countries or local regions.

We investigated population distribution using a spatial division method by identically sized squares. This approach resembles a previous method [9]. In our case, we control the scale of the spatial division by changing the size of the squares and clarify the universal properties concerned with population agglomeration. Population’s universal properties can be observed by changing the scale of the spatial division.

We introduce logarithmic differences between the nearest neighbor two square blocks in terms of population. The regional dependence of these values in terms of the shape of the distributions vanishes for small size scales. The property of the distribution of logarithmic differences is concerned with the correlation coefficient of the population in two squares. This correlation is one index to measure population agglomeration.

In this research, we investigate Japanese population data. In Sect. 14.2, we introduce eight regions to investigate local properties inside Japan. In Sect. 14.3, we compare several distributions concerned with population among these eight regions. Next we compare Japan and Europe in Sect. 14.5 and show the universal properties concerned with population in both cases.

2 Basic Information of Japanese Population

The Statistics Bureau of the Japanese Ministry of Internal Affairs and Communications conducts a census every five years. Much census data can be obtained in a mesh data format from its websites [11], including population data from 2000, 2005, and 2010. The mesh data are raster data that are obtained by equally dividing latitudes and longitudes. A mesh size of about 500×500 m² provides the highest accuracy for population. Mesh codes are assigned to each bit of data, and we can specify the data’s position on the map from this code.

The Japanese Ministry of Land, Infrastructure, Transport and Tourism provides land use data on its website [12]. Such data are also provided in a mesh data format. A mesh of about 100 [m]×100 [m] provides the highest accuracy for land use. In these data, a land use code (see Table 14.1) is assigned to each mesh. An inhabitable place is defined as any place that is fit for humans to live in. Inhabitable areas can be estimated by subtracting such uninhabitable areas as forests and lakes from the land area. We estimated the inhabitable areas by totaling the areas whose land use codes are 1, 2, 7, 9, A, and G. Only about 33 % of Japan’s land area is inhabitable because it has many mountainous areas. This percentage is smaller than European countries. For example, the inhabitable area percentages of Germany, France, and the United Kingdom are 68 %, 71 %, and 88 %, respectively. We have to use inhabitable areas instead of land areas to more precisely evaluate population density.

Table 14.1 Land use code assignment

Full size table

To investigate locality and universality, we divided Japan into the following eight regions based on traditional ways of combining several prefectures (see Fig. 14.1): Hokkaido, Tohoku, Kanto, Chubu, Kansai, Chugoku, Shikoku, and Kyushu. Table 14.2 shows the basic information of the eight regions. Population densities depend on the regions for various reasons.

Table 14.2 Basic information of eight Japanese regions

Full size table

3 Population Distribution in Japan

How to divide space is critical when examining population’s size distribution. Dividing space by municipal level is standard for investigating the size distributions of cities. In this study we do not use such spatial division method. We adopted square blocks of the same size as a spatial division method and divided a particular region into identical sized square lattices. Then we aggregated the population inside the square blocks and observed its population distribution. We can control the spatial division’s scale using this method. We use parameter BS [km], which denotes the size of one side of the square blocks.

Figure 14.2 shows a complementary cumulative distribution function (CCDF)

$$\displaystyle{ \mathrm{Pr}\{X \geq x\} }$$

(14.1)

of Japan’s population in 2010. The distributions of the regions with high-density populations such as Kanto and Kansai are plotted on the right side compared to other regions. The distributions of the regions with low-density populations such as Hokkaido are plotted on the left side compared to other regions. These properties denote the distribution locality. The population distributions vary by region.

To find the distribution quantities that do not depend on the region, we focused on the population distribution’s shape. For a small scale (BS = 0. 5 [km]), the right tail of the distributions rapidly falls. As BS becomes larger, the right tail of the distributions becomes gentler. The slopes of the right tail seem close to each other for a small BS. The value of the logarithmic differences between populations whose values are close to each other seems to share similar quantities of population distribution slopes.^{Footnote 1}

We use S(x, y) to denote the population inside a square whose vertex coordinates are $(x,y),(x +\mathrm{ BS},y),(x +\mathrm{ BS},y +\mathrm{ BS})$, and (x, y + BS). The logarithmic difference between the populations of nearest neighbors in x-direction is represented by

$$\displaystyle{ \ln S(x +\mathrm{ BS},y) -\ln S(x,y), }$$

(14.2)

and the logarithmic difference in y-direction is represented by

$$\displaystyle{ \ln S(x,y +\mathrm{ BS}) -\ln S(x,y). }$$

(14.3)

The logarithmic difference is a value that is frequently used in such time-series analyses as stock prices [13]. In this paper we apply it to spatial directions. The effects of the differences are the same regardless whether the difference direction is positive or negative in terms of the spatial direction. Next we investigate the distributions of the absolute value of the logarithmic differences.

Figure 14.3 shows the CCDF of the absolute value of the logarithmic differences between the nearest neighbor populations in Japan in 2010. For small scale (BS = 0. 5 [km]), the distributions almost overlap. As BS becomes larger, the right tail of the distributions becomes gentler, and they no longer overlap.

Figure 14.4 shows the BS dependence of the moments values of the distributions of absolute value of logarithmic differences. Where n-th order moments is defined by mean of n-th powered of the stochastic variable. These values are one of the quantitative index of the overlapping of the distributions.

Figure 14.5 compares the observed distribution and the distributions represented by analytic functions. The red lines show an exponential distribution whose CCDF is defined by

$$\displaystyle{ \mathrm{Pr}\{X \geq x\} =\int _{ x}^{\infty }\frac{1} {\mu } \exp \left (-\frac{t} {\mu } \right )dt. }$$

(14.4)

Here parameter μ is the distribution’s mean. The estimated values from the data are μ = 1. 1022 for BS = 0. 5 and μ = 1. 5629 for BS = 10. The blue curves show truncated normal distribution, whose CCDF is defined by

$$\displaystyle{ \mathrm{Pr}\{X \geq x\} =\int _{ x}^{\infty }\sqrt{\frac{2} {\pi \sigma ^{2}}} \exp \left (-\frac{t^{2}} {2\sigma ^{2}}\right )dt. }$$

(14.5)

Here parameter $\sigma$ is the standard deviation from the x = 0 of the distribution. The estimated values from the data are $\sigma = 1.4853$ for BS = 0. 5 and $\sigma = 2.0944$ for BS = 10. The shape of the distributions seems to be intermediate between the exponential and the truncated normal distributions. The distributions resemble a truncated normal distribution in a small BS scale. As BS becomes larger, the distribution becomes an exponential distribution. Intermediate distribution between Eq. (14.4) and Eq. (14.5) is represented by

$$\displaystyle{ \mathrm{Pr}\{X \geq x\} =\int _{ x}^{\infty } \frac{\alpha } {\lambda \varGamma \left (\frac{1} {\alpha } \right )}\exp \left (-\frac{t^{\alpha }} {\lambda ^{\alpha }} \right )dt. }$$

(14.6)

Where α is a shape parameter and $\lambda$ is a scale parameter. If α = 1, Eq. (14.6) corresponds to Eq. (14.4). If α = 2, Eq. (14.6) corresponds to Eq. (14.5). The green curves in Fig. 14.5 show distributions of Eq. (14.6). We selected the parameters $\alpha = 1.6,\lambda = 0.9$ for BS = 0. 5 and $\alpha = 1.2,\lambda = 0.9$ for BS = 10.

The shape of the distributions of the logarithmic differences of two values is concerned with the correlation between those two values. The left side of Fig. 14.6 shows a scatter plot of $\ln S(x,y)$ versus $\ln S(x +\mathrm{ BS},y)$ or $\ln S(x,y +\mathrm{ BS})$. From this figure, we observe agglomeration effect that many people live near places where many other people also live. The correlation coefficient is able to interpret as an index of agglomeration effect. The right side figure’s data are transformed from the left side figure’s data by dilating both axis data $\sqrt{ 2}$ and rotating clockwise 45^∘. The horizontal axis of the right side figure is the logarithmic summation between the nearest neighbor populations. The vertical axis of the right side figure is the logarithmic difference between the nearest neighbor populations. The red bars are the standard deviation inside each segment, which is equally divided by the horizontal axis. The correlation of the left side figure represents the correlation between the population and the nearest neighbor population. If this correlation is strong, the population near the large population is large. It is considered that the strengthen of this correlation is one of the indices which represents degree of the agglomeration of population. The deviation of the distribution of the vertical axis of the right side figure concerns the correlation of the left side figure. The deviation of the distribution of the vertical axis of the right side figure shrinks when the correlation of the left side figure becomes strong. It is possible to estimate the degree of agglomeration of the population by observing the deviation of the distribution of the logarithmic difference.

4 Basic Information of European Populations

The European Union provides several kinds of statistical data from eurostat. The GEOSTAT project provides European countries’ population dataset representing in a 1 km² grid dataset. Population data for 2006 and 2011 are available on their website [14].

The food and agriculture organization of the United Nations statistics division (FOSTAT) [15] provides land and forest area data from most countries. We can roughly estimate the inhabitable areas by subtracting forest areas from land areas.

Table 14.3 shows the basic information of the top seven European countries by population. Their population density is lower than Japan. The variation of the population density of each country is smaller than the variation of all eight Japanese regions.

Table 14.3 Basic information of top seven European countries by population

Full size table

5 Comparison between Japan and European Countries

In this section we compare Japan and European countries in terms of the distribution of log differences of population. Figure 14.7 shows the CCDF of the absolute value of the logarithmic differences between the nearest neighbor population of Japan and EU countries. The results are almost the same as those among Japan’s eight regions. As BS becomes larger, the right tail of the distributions becomes gentler. The overlapping of the distributions for BS = 1 is better than for BS = 10. If we observed data whose scale BS = 0. 5, the overlapping would be better than for BS = 1.

The transitions of the distributions due to changes by BS are shown in Fig. 14.8. Japan’s distribution shape is almost the same as that of EU at a small BS. The difference of Japan and EU becomes larger as BS increases.

6 Conclusion

We investigated population distributions using Japan and EU data. Using a spatial division method with same size squares, we can easily control the division scale. The shape of the population distribution differs by country or region. We introduce logarithmic differences between nearest neighbor populations to identify distributions that do not depend on country or region. When the division scale is large, the distribution of logarithmic differences depends on the country or region. The local dependence of the distribution disappears as the division scale becomes smaller. The distribution’s shape closely resembles a normal distribution when the division scale is small; it is close to exponential distribution when the division scale is large.

This study investigated population distributions from a universal standpoint that does not depend on country or region. In general, various interactions determine population distribution. These interactions can be divided into two types. One is internal interactions, and the other is external interactions. External interactions are such environmental elements as topography and habitability. Internal interactions are interactions between people. Our results suggest that a universal feature exists for interaction with a small-scale neighboring population.

The next stage of our study will reproduce the results of Fig. 14.8 using a simple model. If we generate population data randomly, BS dependence of the shape of the distributions of logarithmic differences are quite different from Fig. 14.8. To reproduce the BS dependence of Fig. 14.8, we have to generate population configuration which satisfy the left figures of Fig. 14.6. We will have to introduce interactions between people to generate the agglomeration effect.

It would be interesting if the local features of population distribution could be explained by the interaction between people and environmental factors. We consider that the inhabitable area is most important in the environmental factor. We expect that the interaction between people and geometrical environmental factor is to be detected from relations between fluctuation of the population and the population density per inhabitable area.

Notes

1.

It is possible to confirm of this expectation by left figure of Fig. 14.6.

References

Zipf G (1949) Human behavior and the principle of least-effort. Addison-Wesley, Cambridge
Google Scholar
Hill BM (1970) J Am Stast Assoc 65(331):1220
Article MATH Google Scholar
Gabaix X (1999) Q J Econ 114:739
Article Google Scholar
Eeckhout BJ (2004) Am Econ Rev 94(5):1429
Article Google Scholar
Holmes TJ, Lee S (2009) In: Glaeser E (ed) In the economics of agglomerations, University of Chicago Press, pp 105–131
Google Scholar
Rozenfeld HD, Rybski D, Gabaix X, Makse HA (2011) Am Econ Rev 101(5):2205
Article Google Scholar
Portal site of official statistics of japan. URL http://www.e-stat.go.jp/SG1/estat/eStatTopPorta lE.do
National land numerical information download service. URL http://nlftp.mlit.go.jp/ksj-e/gml/datalist/KsjTmplt-L03-b.html
Mantegna RN, Stanley HE (2002) Nature 376:46
Article ADS Google Scholar
Eurostat geostat project. http://ec.europa.eu/eurostat/web/gisco/geostat-project/
FOSTAT. http://faostat3.fao.org/download/R/RL/E

Download references

Acknowledgements

The authors are grateful to SMSEC 2014, where this work was completed.

Author information

Authors and Affiliations

Faculty of Business Administration and Information Science, Kanazawa Gakuin University, Kanazawa, Ishikawa, Japan
Shouji Fujimoto
Department of Informatics, National Institute of Informatics, Chiyoda-ku, Tokyo, Japan
Takayuki Mizuno
Graduate University for Advanced Studies, Tokyo, Japan
Takayuki Mizuno
The Canon Institute for Global Studies, Marunouchi, Chiyoda-ku, Tokyo, Japan
Takayuki Mizuno
Graduate School of Information Science and Technology, The University of Tokyo, Bunkyo-ku, Tokyo, Japan
Takaaki Ohnishi
The Canon Institute for Global Studies, Chiyoda-ku, Tokyo, Japan
Takaaki Ohnishi & Tsutomu Watanabe
Institute of Real Estate Studies, National University of Singapore, Singapore, Singapore
Chihiro Shimizu
Graduate School of Economics, The University of Tokyo, Bunkyo-ku, Tokyo, Japan
Tsutomu Watanabe

Authors

Shouji Fujimoto

View author publications

You can also search for this author in PubMed Google Scholar
Takayuki Mizuno

View author publications

You can also search for this author in PubMed Google Scholar
Takaaki Ohnishi

View author publications

You can also search for this author in PubMed Google Scholar
Chihiro Shimizu

View author publications

You can also search for this author in PubMed Google Scholar
Tsutomu Watanabe

View author publications

You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shouji Fujimoto .

Editor information

Editors and Affiliations

Department of Computational Intelligence and Systems Science, Sony Computer Science Laboratories, Inc., Shinagawa, Tokyo, Japan
Hideki Takayasu
Department of Applied Physics, The University of Tokyo, Bunkyo, Tokyo, Japan
Nobuyasu Ito
Center for Service Research, National Institute of Advanced Industrial Science and Technology, Tsukuba, Ibaraki, Japan
Itsuki Noda
Dept Computational Intelligence, Tokyo Institute of Technology, Yokohama, Kanagawa, Japan
Misako Takayasu

Rights and permissions

Open Access This book is distributed under the terms of the Creative Commons Attribution Non-commercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.

Reprints and permissions

Copyright information

About this paper

Cite this paper

Fujimoto, S., Mizuno, T., Ohnishi, T., Shimizu, C., Watanabe, T. (2015). Geographic Dependency of Population Distribution. In: Takayasu, H., Ito, N., Noda, I., Takayasu, M. (eds) Proceedings of the International Conference on Social Modeling and Simulation, plus Econophysics Colloquium 2014. Springer Proceedings in Complexity. Springer, Cham. https://doi.org/10.1007/978-3-319-20591-5_14

Download citation

DOI: https://doi.org/10.1007/978-3-319-20591-5_14
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-20590-8
Online ISBN: 978-3-319-20591-5
eBook Packages: Physics and AstronomyPhysics and Astronomy (R0)

Publish with us

Policies and ethics

Abstract

1 Introduction

2 Basic Information of Japanese Population

3 Population Distribution in Japan

4 Basic Information of European Populations

5 Comparison between Japan and European Countries

6 Conclusion

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation