Antonio COLONNA1, Vittorio DI STEFANO1, Silvana LOMBARDO1,2, Lorenzo PAPINI3 and Giovanni A. RABINO3
1LAC - Laboratorio per le Applicazioni del
Calcolo, Università di Roma "La Sapienza", via
Gramsci 53, 00197 Rome, Italy.
2DPTU - Dipartimento di pianificazione Territoriale
e Urbanistica, Università di Roma "La Sapienza",
via Flaminia 70, 00196 Rome, Italy.
Email: victor@divinf.it
3DISET - Dipartimento di Ingegneria dei
Sistemi Edilizi e Territoriali, Politecnico di Milano, piazza
Leonardo da Vinci 32, 20133 Milan, Italy.
Email: Giovanni.Rabino@polimi.it
This paper presents the application to a real case study of the theoretical-methodological work presented at ACRI96 (Papini and Rabino, 1997), concerning the development of a new generation of urban models, which can be called "intelligent" simulators (in the sense of A.I.) of the urban systems evolution.
In chapter 2 the conceptual meaning of such theoretical proposal is reassumed, by identifying its roots in different veins of reasoning. In chapter 3, learning cellular automata are presented as operational proposal and the mathematical-computer details of the prototype carried out are specified.
The following chapters describe the application carried out to the Rome urban system: chapter 4 presents the way in which the urban system was represented; chapter 5 describes the "design" of the model, that is the learning mechanisms adopted in the case study; chapter 6 shows the results, which correspond to what the model "discovered" about the transformation factors working in the study area.
In a moment apparently propitious for a relaunching of both theoretical and operational urban mathematical modelling (see Klosterman, 1994), the development of such activity and the proposal of new models request the consideration of at least four aspects: the (new) epistemological climate; the evolution of urban modelling in recent years; the (changed) computer context (from both the point of views of theory and tools); the always crucial connection with urban planning.
In the development of new urban models, the new epistemological climate and the "science of complexity" cannot be neglected. A lot has been written and said about this "science" (in the field of urban planning, see for instance Rabino, 1995 and Lombardo, 1995), so that we will remind here neither its phenomenology (irreversibility, selforganization, etc.) nor its principles, that must be taken into consideration in model building.
It must be stressed, however, a direction of analysis often neglected: that one which considers not only the representation of a complex reality in a model, but also the "location" of the model itself in a more complex world. In other words, the idea is to reconsider firstly the relation model-model builder from the point of view of complexity.
In this sense, we can see that, starting from the model viewed as an "objective representation" of the analysed system, a new idea is recently gaining ground, which looks at a model as an "organised system of knowledge" about the studied system.
This change is discussed in Occelli and Rabino (1997); here we add that the new point of view allows in some way the model to have an autonomous life with respect to the model builder, therefore, it is not inappropriate to consider the model as a cognitive system (an intelligent system) observing a given reality.
In recent years, urban modelling showed two main characteristics:
The former aspect will be treated in chapter 2.3. As to the latter, we can say that Allen's model (Allen and Sanglier, 1979) is the paradigmatic representative of the modelling change which counts contributions by several authors: Bertuglia, Diappi, Lombardo, Rabino, Reggiani, in Italy and Haag, Pumain, Wilson, etc. (beside Allen) in Germany, France, England.
Though Allen's model produced in the eighties a strong innovative drive, now we can in retrospect recognise in this model two substantial weaknesses (that can be found, however, also in the work of the other authors quoted) with respect to the modelling change mentioned above:
Therefore, the challenge for the theoretical urban modelling is still the development of really evolutionary models.
In addition, we can add that:
We cannot discuss the development of urban modelling without taking into account the development of computers, which cancelled all the elements of the "requiem for large scale models" of the seventies (see Klosterman, 1994).
A new style of "management" of the models has become inevitable: the elaboration of data through standard applications (e.g. Excel), the friendly use of the model by means of Windows-like menus, the cartographic mapping of data (GIS), etc.
However, it is to be regretted that, beside the direct exploitation of the new possibilities, little has been made in order to probe the new open fields of the interaction between modelling and computers (e.g. interactive modelling, interface between models and GIS, etc.).
The same can be said as to the application of A.I. tools to urban analysis and urban planning: in these fields, the use of Expert Systems, Neural Network and the like comes often to straight applications of such techniques.
It is then necessary, beyond techniques, to work at an innovative use of such tools, which could catch their deep nature of "knowledge engineering methods" (e.g. learning methods, forms of knowledge management, etc.) and provide the tools for the development of territorial "cognitive systems" (that is, new models).
Apart from the opinions about the usefulness of mathematical models, their application is always associated with the analytical phase of the planning process, as a tool to build information and knowledge organisation (e.g., in the evaluation phase, in order to probe the effects of planning actions).
However, we can wonder whether modelling could be present also in the really creative moments of the planning process (e.g. problems individuation, the "invention" of plan alternatives). Now it seems possible (Rabino, 1995), in fact the emerging idea of a model conceived as an evolutive cognitive system (selforganizing) stresses that the new focus is on the "creative processes" (and therefore on the creative aspects of planning).
The development of a new modelling, then, seems to aim at a new and more relevant role in the context of urban and territorial planning and management.
All the elements introduced in the previous chapters aim toward one goal:
In the next chapter we try to traduce this ambitious goal in an operational project.
The development of a model with the characteristics said above requests to go over again the logical process of model building, in order to find the aspects of such process which can be modified in the wished direction. It can be found that in the connection between "real world", that is the world of the observed phenomenon (the phenomenon to be modelled), and "formal world", that is the world of the modelled phenomenon, there lies a cyclic mechanism of induction and deduction (of creative intelligence and of "building" intelligence), as follows:
The building of a really evolutionary model (in the sense explained above) means that the process described above must be made endogenous in the model.
Obviously, in a researcher, such process is strongly influenced by his/her specific conditions: his/her knowledge on the topic, his/her general culture, his/her keenness, etc.. In the same way, in the really evolutionary model, the problem is to define a specific context inside which "to invent" the model; such context, in this initial phase of our ambitious project, will be necessarily a very limited dominion of knowledge.
In other terms, it is reasonable to aim at building a "mechanism" which is certainly not able to invent a totally new model, but which is able at least to specify some elements of a model already outlined as to its general features. It means that we cannot aspire, for the moment, to reproduce all the formulation-resolution-verification cycle from its first iteration, but we can aspire to replicate such cycle starting from a first formulation of the model, exogenously given.
Therefore, the task is:
In equivalent terms, we need that:
As to the metamodel, we will resort to the knowledge in the field of the existing urban modelling, as to the learning algorithms, we will look at the field of A.I. (neural networks, classifiers systems, etc.).
In order to make operational the general model proposed in 3.1, cellular automata are specially suitable, for the following reasons:
At this point, it is useful to remind briefly the way of working of "classic" cellular automata:
In the learning cellular automata, this latter operation is replaced by an automatic procedure of learning of the rules mentioned above.
There are two aspects to be specified:
As far as the latter aspect is concerned, two more aspects must be specified:
b1. |
the level of semantic complexity at which
the rules to be learnt are expressed (e.g. as general urban planning
principles or more specific modelling rules); |
b2. | the "form" and the "criterion" by which the fitting between the learnt map and the target map is evaluated (in other terms, the "sensors" of the learning algorithm must be specified). |
On the side of the semantic complexity, in this initial steps of our work, we adopted the simplest solution, that is the direct expression of the rules in the same form as they act in the CA.
On the side of the sensors too, we choose the simplest solution, that is to accept or reject the rules depending on the matching (after their reiterated application) between the observed and the learnt state of a single cell of the automaton. More precisely, this matching (at single cell level) is analysed for a very great number of randomly selected cells and the rule is either prized (by increasing its level of "confirmation" and then its "strength") or punished (by decreasing its "strength") depending on the goodness of fit.
The structure of the prototype is described in details in Papini and Rabino, 1997. It can be said here that, on one side we have a classic cellular automaton, while the other side presents a learning system (a typical classifiers system), divided into its three components: a rules generator, a rules "evaluator" and a module for the management of the system itself. The rules and the sensors are respectively the input and output interfaces of the automaton with respect to the classifiers system.
Moreover, it is to be specified that the rules generator is a genetic algorithm which "invents" new possible rules (to be evaluated) starting from previous rules, through the usual evolutionary mechanisms (random mutation, crossover, etc.). Therefore, the evolutionary character (beside that of learning) can be ascribed to the proposed cellular automaton.
Finally, we recall that the rules introduced in the model are rules of the kind IF ...THEN ... (condition-action rule, typical in classifiers systems) and codified by means of a bits string. The string defines the initial state of the cell, the state of its neighbouring, the presence (or absence) of other attributes of the cell and the final state of the cell, codified in the same way as the initial state.
In the application of a learning cellular automata to an urban system, a particularly critical phase is represented by the writing out of the "textbook", that is of the contents and architecture of the information which are supplied for the automata.
Such architecture must consist of an efficient and effective description of both the spatial and functional structures of the urban system, as well as of its evolution, so that the model can read and interpret the evolution of such structures and deduce the rules which produced the transformation occurred in the considered time interval.
We describe the problems met in the description of such information structure and indicate the solution adopted in the application to Rome metropolitan area: these solutions are quite different from those that can be found in the classic applications of cellular automata urban systems.
In the models conceived and/or applied in the US (since Lowry model up to C. A.) urban space has been usually disaggregated by adopting orthogonal grids.
This choice originates from the morphologic structure of the urban texture of the American cities and from the corresponding shape of the statistical elementary zones.
This representation of space turns out very useful in many contexts of urban modelling, ranging from the computation of distances and routes (in general, everything connected with the use of graphs), to all computer graphics problems, in which each square or rectangular sub-area can correspond with one or more pixels.
In Italy, such urban textures can be found only in the parts of cities built after the middle of the 19th century. The remaining, which is the most part, presents a spatial organisation and shape of buildings which cannot be represented by means of an orthogonal grid unless operating spatial deformations and data approximations so strong to be unacceptable, especially when working at a small and detailed scale, inside the urban "continuum".
Also the structure of distances and accessibility, on a network resulting from the overlay of centuries, is more complex than a structure in which the orthogonal grid prevails, with its regularity of blocks and of their size.
Therefore, we abandoned the idea of adopting an orthogonal grid for the representation of space, though aware of the increase of the level of complexity of the task, from both a semantic and a syntactic point of view. We then undertook the task of interfacing the C.A. with an algorithm which allows to represent a system in which cells be of any shape and assembled in any way.
By means of an appropriate software, it is then necessary to define and individuate adjacencies between such cells of any shape. We tackled this task being aware that it would not be useful only for the present application, but it would produce other relevant advantages, that are:
The problem of the identification of the boundaries of the system and that of its "closure" (i.e. how to model the relations between the "inside" and the "outside" of the system) is not new in urban modelling and has been always posed in the following terms:
1. | boundaries: how to determine the boundaries of the system in such a way to ensure that the most part of the considered phenomena and processes take place inside the system and therefore interactions with the outside be weak enough. |
The problem, which is included in the more general problem of the individuation of sub-regions, was often tackled by means of clustering techniques. (applied to interaction flows in a determined time section.
However, in the case at study, the spatial and functional interdependencies (the "rules"): are "found out" by the model; are rendered explicit only at the end of the learning process; are dynamic, as they are derived by the observed evolution of the system.
To overcome these problems, we must go through these steps: 1. arbitrary choice of the boundaries of a subsystem; 2. application of the C.A. and identification of the transformation rules; 3. comparison between simulated and real evolution; 4. if the fitting is not satisfying, modify the boundary and go back to step n. 2.
2. | "closure" of the system: how to model the relations, however weak, which exist between the inside and the outside of the system. |
In the field of the application of classic CA, the problem concerns the cells located on the border, and the solutions adopted most frequently are: the system is closed like a torus; "mirrors" are placed on the borders (Serra, Zanarini, 1990), in some cases, with different "reflecting capacity".
In the applications to urban systems, these solutions are not satisfying, because:
In the application of our model, then, we choose to adopt a solution similar to that used for SIA models (see for instance Giangrande, Lombardo and Mortola, 1977).
It consisted in introducing "external zones": it was modelled their effect on the zones of the system, while it was not modelled the effect of the system on them.
This corresponds to the introduction of external "seeds" (Batty and Xie, 1994), which are "seen" by the C.A., but don't change during the dynamic evolution of the system and belong to the class of "fixed cells" (see the following chapter).
As in any representation of urban and territorial phenomena, the problem is to find the best compromise between the quality of the representation, which requests a satisfactory level of detail, and the possibility of "reading" the representation, which implies problems of size of the data base.
In our application the limits derive from the CA "machine" which reads the rules, where a rule is the specification of all the land use transformations (and the number of transformations is proportional to the square of the considered land uses).
Therefore, the represented land uses must have the following requisites: a) to be in a number "bearable" by the algorithm; b) to be obtained by aggregations of urban activities homogeneous as to location behaviour; c) such aggregations must not "hide" the changes occurred, that is they must allow to catch the (most part of) change.
On the basis of the above considerations , we analysed the 1981 and 1991 Rome Census data, referred to 5600 Census zones and to 674 types of activity.
The elaboration were made more complex by the awareness that, for a relevant part of the urban area, it was not acceptable to define only one prevailing land use (this problem was stressed also in Batty, Couclelis and Eichen, 1997), but it was necessary to introduce mixed land uses of the cells. We then had to define appropriate indicators and develop statistical analyses aimed to the individuation of thresholds which allowed to define, for each cell, the prevailing land use or the different mixes of land uses.
This analysis was developed following four consecutive levels:
1st level. Individuation of the cells with only residential use (R), with only productive use (P), with mixed use (M).
Indicator: IR = population / (pop.+total jobs)
Thresholds: IR > 0.9 Þ R; IR < 0.7 Þ P; 0.7 < IR <0.9
ÞM2nd level. Classification of the cells with only residential use (R) as medium-high density cells (Ra), medium-low density cells (Rb) and empty cells (V).
Indicator: IRD = inhabitants per hectare
Thresholds: IRD ³ 250 Þ Ra; 10 < IRD < 250 Þ Rb; IRD < 10 ÞV
3rd level. Classification of the cells with only productive use (P) for different economic activities (Pi); identification of further empty cells (V).
The economic activities were aggregated in such a way to satisfy the requisites described in 4.3.1:
1) TS: high tertiary 2) TI: common tertiary 3) IN: industry 4) SA: health services
Indicator: IPi = employed in activity i / total of employed
For each cell, it is chosen the activity with the higher number of employed (P1) and the next one (P2).
Thresholds: IP1 - IP2 > 0.1 Þ P1 is the prevailing activity, otherwise there is a mix of activities,
identified in the next level.
4th level. Classification of cells with mixed use (M) for different mixes (2 max).
The cells where there is a mix of residential and productive land use were identified at the 1st level and
the economic activity with which residential land use is mixed is defined by the maximum value of IPi
To identify the mix of activities, we assumed: IP1 - IP2 £ 0.1 Þ mix of P1 and P2.
As a consequence, the considered land use are 14 (see Table 1).
Table 1: The considered land uses
Prevailing | Symbol | Mix | Symbol |
Empty | V | Residence+high tertiary | M |
High density residence | R | Residence+low tertiary | M |
Low density residence | R | Residence+industry | M |
High tertiary | P | Residence+health services | M |
Common tertiary | P | High tertiary+common tertiary | P |
Industry | P | High tertiary+industry | P |
Health services | P | Common tertiary+industry | P |
Ec. activities+health services | P |
Figures 1and 2 show - as one example - the 1981 residential land use and the transformation occurred between 1981 and 1991.
Figure 1: Residential land use in Rome (1981)
Figure 2: Residential land use transformation (1981-1991)
It is to be considered that the general category here called "land use" includes land uses and/or characteristics (referred to points or to areas) which are fixed characteristics ("attributes") of the cells: they do not change, but influence the system dynamics.
Therefore, they are "seen" by the model as only "active" characteristics and not as "active and passive" ones.
The considered attributes can be distinguished as follow:
Other attributes do not make the cell a "fixed" cell; they are: 6) adjacency to a cell which presents one of the 5 attributes listed above; 7) historical centre (it influences attractivity and location costs).
These attributes are mapped in Figure 3; and the state of a (not fixed) cell is therefore defined by 12 bits: a) land use (4 bits); b) presence/absence of: station, access to highway, historical centre (1 bit each); c) adjacency to: station, access to highway, historical centre, park, outside "seed" (1 bit each)
Figure 3: Attributes of the cells
As said before, we think that it must be taken into consideration the fact that spatial influence reaches to some distance from the "source" and then does not involve only its adjacent cells. Therefore, unlike in the classic CA, in which spatial influence depends only on adjacency between cells, we introduced a continuous measure of the distance between cells, in such a way giving model the possibility of "studying" and introducing the idea of radius of influence of a cell and of its land use.
Indeed, in some applications of CA to urban systems there were introduced "bands" of "preferable" distance for the relations between different land uses. In these applications, though, it corresponded to the introduction of an additional rule, while, for our model, the distance matrix is an additional information whose role and importance is to be learnt.
However, as the use of a distance matrix with a size of 5600x5600 elements would produce difficulties in the computations and as it is likely that the spatial influences do not have relevant effects beyond a certain distance, we considered only distances not greater than 4 km. In other terms, it correspond to the adoption of an "enlarged adjacency", while the strict adjacency is used only as far as the influence of "attributes" is concerned.
In the experiments, we adopted geometrical distances between cells, in order to simplify the task of the preparation of the data base.
In synthesis, in our case, the problem is "to tune in" model sensors, designing its structure in such a way that it is able to read and interpret different typologies of phenomena and of evolutive structures. Table 2 lists some examples of phenomena and of evolutive structures of the system, with increasing complexity, on which sensors of our learning C.A.can be tuned in.
In this first simplified application to Rome urban
system, model was tuned in the recognition of both inertial and
dynamic behaviours and in the effects produced on each cell by
the status of adjacent cells (see the symbol "*"
in Table 2), while the effect of a continuous measure of distance
between cells was not introduced in the experiment described here.
Table 2: Examples of phenomena recognisable by the model
Dynamic behaviour | * Inertia
* Change |
Local effects | * Imitative
* Complementary Competitive Hierarchical . . . . . . . |
Field effects | Accessibility
. . . . . . . . . . |
Recognisable structures | Global level structures |
About 300 rules of land use transformation were identified, each of them can be evaluated through its "strength" by ordering them according to decreasing values of such parameter. If we take into account the first 15 rules, we can say that model learnt, beside the existence of a certain level of inertia of the system, that the most part of the transformations took place in peripheral areas and that the urbanisation phenomena took place in the cells with a good level of access to the public transport network and/or to the highways and in those with a good environmental quality (adjacency to parks).
The results of learning process can be synthesised in the following way:
| Cells land use (1981):
| In the neighbouring of:
| changed as:
| strength:
|
A)
| Empty
Low density residence
| # Railway stations
#*Highway accesses * Parks # Other municipalities-
| Low density residence
Resid.+Com. tertiary
| Very high
Med. high
|
B) | Low density residence | Sea | Industry | Med. high |
Low density residence | Tertiary mix | Med. high | ||
| Empty
Empty
|
| Residence+Industry
Residence+Tertiary
| Med. high
High
|
C)
| Empty
| Parks
Highway accesses
| Health services
| High
|
These rules concern mainly areas located in the periphery of the urban system; in particular:
The evolution rules identified were then applied to produce a simulation of the system. By comparing observed and simulated urban structure, the following level of error can be observed:
Figure 4: Errors in land use simulation (1991)
In conclusion, in our opinion, this first experiment with a complex and totally new tool such as a learning urban cellular automata produced very encouraging results, enough to contradict a recent statement made by Batty and Xie (1997):
As to the research prosecution, considering the simplifications adopted, not so much for the system representation (however, in next experiments, we will introduce real estate price, for instance) as for the sensors' structure of the CA, it will be oriented to the re-designing of the sensors, aimed to their sharpening, in order to allow the recognition of more complex phenomena, such as those indicated in Table 2.
Allen P.M., (1979) A dynamic model of growth in a central place system. Geographical Analysis, 11, 256-272.
Batty M., Couclelis H., Eichen M. (1997) Special issue: Urban systems as cellular automata. Environment and Planning B, 24, 159-164.
Batty M., Xie Y. (1994) From cells to cities. Environment and Planning B, 21, 531-548.
Batty M., Xie Y. (1997) Possible urban automata. Environment and Planning B, 24, 175-192.
Besussi E., Cecchini A. (1996) Artificial Worlds and Urban Studies, DAEST n.1, Venezia.
Giangrande A., Lombardo S., Mortola E. (1977) Calibratura del modello di Lowry estesa alla variabili di attrazione zonale: applicazione ad una sub-area dell'Alto Lazio. Atti delle Giornate di Lavoro AIRO. Parma, 731-751.
Klosterman R.E. (1994) Large-scale Urban Models: Retrospect and Prospect. Journal of American Planning Association, 60, 3-6.
Lombardo S. (1995) Note sulla valutazione dell'incertezza nei sistemi complessi. Atti del Seminario Internazionale "La città e le sue scienze", AISRe, Perugia, 86-102.
Occelli S., Rabino G.A. (1997) Un modello urbano operativo per la città post-fordista. Atti della XVIII Conferenza Italiana di Scienze Regionali, AISRe, Siracusa, 435-455.
Papini L., Rabino G.A. (1997) Urban Cellular Automata: an evolutionary prototype, in ( Bandini and Mauri eds.) ACRI '96. Proceedings of the Second Conference on Cellular Automata for Research and Industry. Springer , Berlin. 147-157.
Rabino G.A. (1995) Tesi di pianificazione urbanistica. Atti del Seminario Internazionale "La città e le sue scienze", AISRe, Perugia, 46-66.
Sanders L., Pumain D., Mathian H., Guerin-Pace F., Bura S. (1997) SIMPOP: a multi-agent system for the study of urbanism. Environment and Planning B, 24, 287-305.
Serra R, Zanarini, G. (1990) Complex Systems and Cognitive Processes. Springer Verlag, Berlino.
White R, Engelen G., Uljee (1993) Cellular automata modelling of fractal urban land use patterns: forecasting change for planning applications. paper presented at 8° European Colloquium on Theoretical and Quantitative Geography, Budapest.