On the Perturbation of Self-Organized Urban Street Networks

We investigate urban street networks as a whole within the frameworks of information physics and statistical physics. Urban street networks are envisaged as evolving social systems subject to a Boltzmann-mesoscopic entropy conservation. For self-organized urban street networks, our paradigm has already allowed us to recover the effectively observed scale-free distribution of roads and to foresee the distribution of junctions. The entropy conservation is interpreted as the conservation of the surprisal of the city-dwellers for their urban street network. In view to extend our investigations to other urban street networks, we consider to perturb our model for self-organized urban street networks by adding an external surprisal drift. We obtain the statistics for slightly drifted self-organized urban street networks. Besides being practical and manageable, this statistics separates the macroscopic evolution scale parameter from the mesoscopic social parameters. This opens the door to observational investigations on the universality of the evolution scale parameter. Ultimately, we argue that the strength of the external surprisal drift might be an indicator for the disengagement of the city-dwellers for their city.


Introduction
We seek to understand the statistics of urban street networks. Such an understanding will help urban designers and decision makers to improve urban policies in general and urban transportation in particular. In our work we investigate urban street networks as a whole within the frameworks of information physics (Knuth, 2011) and statistical physics (Jaynes, 1957a,b).
Although the number of times that a natural road crosses an other one has been widely observed to follow a discrete Pareto probability distribution (Clauset et al., 2009) among self-organized cities (Alexander, 1965;Crucitti et al., 2006;Jiang et al., 2014), very few efforts have focused on deriving the statistics of urban street networks from fundamental principles. Here a natural road (or road) denotes an accepted substitute for a "named" street (Jiang et al., 2014). In a recent work (Benoit and Jabari, 2019), we introduce a statistical physics model that derives the statistics of self-organized urban street networks by applying Jaynes's maximum entropy principle (Jaynes, 1957a,b) through the information physics paradigm (Knuth, 2011).
Our approach explicitly emphasizes the road-junction hierarchy of the initial urban street network rather than implicitly splitting it accordingly in two dual but distinct networks. Most of the investigations indeed seek to cast the initial urban 2004; Jiang and Liu, 2009;Jiang et al., 2008Jiang et al., , 2014Masucci et al., 2009;Porta et al., 2006b;Rosvall et al., 2005).
As an alternative, we may consider instead coloured city maps with lines of arbitrary widths. We have in mind street maps. Basically, a street map of a city has the particularity to exhibit how the city-dwellers perceive the urban street network of their own city. Explicitly, it shows how they have gathered along the time the street-segments of their own city to form streets. Implicitly, it reveals that we human townmen rather reason in terms of streets than of street-segments. But over all, deeply, it betrays a topological mindset that looks on street maps essentially for topological information. Indeed, to move from one place to another, we seek for directional information with the following three characteristic traits: (i) each pair of successive streets must critically share a common junction -whichever it is; (ii) each junction in itself plays a secondary role; (iii) neither position nor distance is important. The topological approach forces these three characteristic traits by reducing road maps to (topological) road-road networks. Here a natural road (or road, for short) is an accepted substitute for street (more precisely, for "named" street). A road-road network reduces roads to nodes and bonds each pair that shares a common junction. In contrast to street-junction networks, road-road networks are subject to complex network behaviours. In particular, they exhibit scale-freeness. We have a slight grasp of scale-freeness for an urban street network whenever we apprehend that only a few streets cross a large number of them, several streets cross an intermediate number of them, and very many streets cross a small number of them. Real data analysis shows that the valence distribution for the road-road network of a self-organized urban street network typically follows an inverse-power scaling law, namely, a scale-free power law (Crucitti et al., 2006;Jiang, 2007;Jiang and Claramunt, 2004;Jiang et al., 2008Jiang et al., , 2014Porta et al., 2006a,b). Thusly, this second attempt appears far more pertinent for at least two reasons. Firstly, the topological description unveils that urban street networks underlie complex behaviours generally observed in real complex networks (Jiang and Liu, 2009;Jiang et al., 2008Jiang et al., , 2014Masucci and Molinero, 2016;Masucci et al., 2009;Porta et al., 2006b;Rosvall et al., 2005). Secondly, it isolates a category of real urban street networks that shows evidence of a 'pure' scaling behaviour. This category may serve as a reference from which any general urban street networks deviates.

Road-Road Networks Mask Road-Junction Partial Orders
Even though the topological approach leads to precious observations, it remains mostly a descriptive tool. The topological approach does not provide any explanation, it is not concerned about the underlying principles for how urban street networks are emerging. A structural approach that does not bypass street-junctions (or road-junctions) allows us to establish a statistical physics foundation for the 'pure' scaling behaviour as effectively observed among self-organized urban street networks (Benoit and Jabari, 2019). It is fair to add that the structural approach may lead to alternative foundations, but also that it does not fail to catch the 'true structure' of urban street networks by forcing the three above topological characteristic traits a bit too early.
Here urban street networks are envisioned as a whole where road literally tie together through junctions. To begin with, we represent the ties by an incidence relation that gathers for each road all junctions through which it passes (Jiang et al., 2008). Then, we interpret this road-junction incidence relation as an object/attribute relation for which any road acts as an objects and any junction as an attribute (Atkin, 1974;Davey and Priestley, 2002;Ho, 1982). Eventually, by invoking the Formal Concept Analysis (FCA) paradigm, this change of perspective allows us to establish bijectively a partial-order relation (Davey and Priestley, 2002;Ho, 1982). In other words, every urban street network is subject to and bijectively representable by a partial-order.
More interestingly, any partial-order can be represented by an abstract ordered structure known as Galois lattice (Davey and Priestley, 2002;Ho, 1982). In general, a Galois lattice organizes itself in layers with respect to its partial-order, so that it can give rise to sympathetic graphical representations called Hasse diagrams (Davey and Priestley, 2002). It appears that urban street networks effortlessly reduce to intuitive two-layer Galois lattices: the roads and the junctions make up the lower nontrivial layer and the upper nontrivial layer, respectively; the 'imply' ordering relation (or join operator) is "passing through" (or "crossing at").
Distributivity is an important property of Galois lattices (Davey and Priestley, 2002). In particular, for any finite Galois lattice, distributivity allows us to claim that the elements of the first nontrivial layer are the join-irreducible elements of the Galois lattice; that is, each upper elements can be expressed as a join chain composed with elements of the first nontrivial layer -while none element of the first nontrivial layer can be decomposed. In our context, distributivity corresponds to the intuition that any junction is a crossing of only two roads. Therefore, any junction that joins more than two roads render the underlying Galois lattice nondistributive. However, as we have seen above, junctions mostly join two roads. Second, any junction that joins at least three roads can be replaced by a roundabout so that it remains only junctions that joins at most two roads. For theses two reasons, we may qualify as canonical any urban street network whose junctions effectively join only two roads.
Furthermore, it is noticeable that the underlying Galois lattice proves not only to reduce bijectively but also to reflect pertinently the involving topological complexity. Indeed, each underlying Galois lattice assigns a clear primary role to roads and a clear secondary role to junctions so that the three topological characteristic traits are valorized as they should be: roads imply junctions; roads are irreducible, while junctions are reducible to roads. By now, most of us should recognize road-road networks as either zeroth order approximations or projections of road-junction Galois lattices by employing either an analytic analogy or a geometric one, respectively.
To summarize, any urban street network bijectively reduces its topological complexity to an essentially distributive two-layer Galois lattice, while its canonicalization renders the latter plainly distributive.

Partial Orders Are Algebraic Structures
Actually, Galois lattices are not only ordered structures but also algebraic structures (Davey and Priestley, 2002;Ho, 1982). To put it another way, the join operator (or partial order) not only permit us to construct the entire Galois lattice from its joinirreducible elements (Davey and Priestley, 2002;Ho, 1982) but also to consistently assign numbers to its elements so that the algebra of these numbers reflects the algebra of the Galois lattice while their order respects its partial-order (Knuth, 2005(Knuth, , 2008(Knuth, , 2009(Knuth, , 2011(Knuth, , 2014. The evaluation of partially ordered sets (or Galois lattices) is the main object of the theory of information physics (Knuth, 2005(Knuth, , 2008(Knuth, , 2009(Knuth, , 2011(Knuth, , 2014. Because quantifying roads and their concomitant junctions enable us to confront models motivated by principles against observed data, the structural approach liberates us to restrain ourselves to perform sophisticated but nevertheless blind data analysis. Given this, the structural approach may no more appear as a gratuitous aesthetic step to the most skeptical readers. In brief, the structural approach is a game changer. In fact, three Galois lattices are getting involved (Knuth, 2008(Knuth, , 2009). Let us now, in order to forge for ourselves a better comprehensive picture, succinctly describe them and their respective valuation functions. The task is relatively easy since we are already familiar with the Galois lattice of our system, with the valuation function of the first extra Galois lattice, and almost with the valuation function of the last Galois lattice. Of course, our first and foremost Galois lattice is the system itself, so that our unique unknown valuation function Va is simply meant to describe the physics of our system. Specifically, the unknown valuation function Va assigns a positive real number indiscriminately to all roads and junctions so that each assigned positive real number characterises the physical state of the involved road or junction. Notice that valuation functions must be positive for consistency reasons. As generic Galois lattice components, roads and junctions organizes themselves in downsets. A downset is a set of elements which contains all the elements implying each of them (Davey and Priestley, 2002). If we mentally sketch our urban street network randomly by roads and junctions under the unique rule that a junction can be dotted only when all of its joining roads are already lined, then each downset represents a state of our mental picture -and vice versa. The set of all downsets ordered according to set inclusion ⊆ forms a distributive Galois lattice, which is called the state space. The state space is an auxiliary Galois lattice which merely helps us to introduce the next relevant one.
The join-irreducibles of the state space are the downsets associated to every road or junction, that is, the singleton sets composed of one road and sets composed of one junction along all of its joining roads. These join-irreducibles generate the state space with set union ∪ as join-operator. Nevertheless, in reality, given citydwellers may not know precisely which state their mental picture of the urban street network represents. Even so, they may have some information that exclude some states, but not others. Therefrom, the mental pictures of city-dwellers are mostly sets of potential states than single states know with certainty. A set of potential states is called a statement. The set of all possible statements is simply the powerset generated from the set of all states. Once ordered according to set inclusion ⊆, the set of all statements becomes a distributive Galois lattice whose the join-irreducibles are the states. This Galois lattice is known as the hypothesis space. Within the hypothesis space, statements follows a logical deduction order as each statement literally implies (or is included in) a statement with certainty. The valuation function associated to any hypothesis space is recognized to be a probability distribution. So, we are already very familiar with the algebra satisfied by the valuation functions associated to hypothesis spaces. Among valuation functions associated to Galois lattices, this algebra can be shown to be the only one possible by imposing natural algebraic consistency restrictions.
Let us digress briefly to bring our attention back to the system valuation function Va: as immediate consequence, for canonical urban street networks, the evaluation Va(j(r, s)) of a junction j(r, s) joining a pair of roads (r, s) must be the sum of the evaluations Va(r) and Va(s) of the joined roads r and s, respectively; we have Va(j(r, s)) = Va(r) + Va(s). (1) End of digression.
Further consistency requirements oblige us to identify the system probability distribution Pr as the composition of a weight function w with the system valuation function Va; we read (2) The weight function w constitutes our second unknown function. The construction of the hypothesis space from the state space corresponds technically to an exponentiation (Davey and Priestley, 2002). The exponentiation of the hypothesis space brings up an inquiry space. The hypothesis space is a distributive Galois lattice whose elements are questions. The quantification of the hypothesis space leads to a measure, coined relevance, which is a generalized measure of information with Shannon entropy as basis (Aczél et al., 1974). In fact, the inquiry space is Carrollian in the sense that it contains both vain (and fanciful) and unvain questions. The formers mainly occupy the lower part of the inquiry space, the latters mainly the upper part. The relevance of the smallest unvain question, which can be shown to be the most relevant question, identifies with the entropy of the system. To be brief, for canonical urban street networks, the functional entropy H[Va, w] takes the form where the first summation runs over the roads r and the second one over the junctions j(r, s) joining the pair of roads (r, s), while h : x → −x ln x is the Shannon entropy function. We will keep to express information measures in nat units. For further details on the theory of information physics, we simply refer to the work of Knuth (2005Knuth ( , 2008Knuth ( , 2009Knuth ( , 2011Knuth ( , 2014. For now, we have enough material to step forward.

From Galoisean Hierarchy to Paretian Coherence
Network data analysis shows that city-dwellers have a topological perception of their urban street networks. On the other hand, the topology of urban street networks hides a simple road-junction partial order that bijectively reduces to intuitive twolayer Galois lattices. The Galoisean hierarchy is intuitive in the sense that its joinoperator expresses our intuition that two roads join to form a junction. Nonetheless this intuitive hierarchy leads to two layers whose cardinality might be perceived as incommensurable. Typical big cities count far more than several roads and junctions. The apparent simplicity of the underlying Galois lattices is the result of an algorithmic thought. Nonetheless the Galoisean hierarchy is three-fold. While the ordering and algebraic perspectives are respectively structural and operational, the whole is measurable. The underlying algebraic structure leads unambiguously to a unique quantification modulo two unknown functions that we are free to choose. These two unknown functions are of different nature. The valuation function Va assigns to each road or junction of the urban street network a numerical quantity that characterizes its physical state. The weight function w, or more precisely its composition with the valuation function Va as expressed in (2), allows us to assign to each mental picture of the urban street network a numerical quantity that characterizes its perception among the city-dwellers. This assignment is simply the probability distribution Pr of our system. Ultimately all these mental pictures are surrounded by all sort of questions whose pertinence can be measured. The relevance of the most pertinent question is better known as the entropy of the system. The most plausible probability Pr, that is, the quantification which tends to represent at best the perception of the city-dwellers for their own urban street network, must also be the most relevant one. In other words, the most plausible probability Pr must maximize the functional entropy (3) of their urban street network. This is nothing other than Jaynes's maximum entropy principle (Jaynes, 1957a(Jaynes, ,b, 1989Kapur and Kesavan, 1992;Kesavan, 2009;Knuth, 2008). Thusly, our physical content shifts from a logarithmic order to a fluctuating organization. Roads and junctions indiscriminately yield our initial ignorance Jaynes (1989). The most we can tell is that roads and junctions are mesoscopic systems with a finite number of possible configurations Ω. Besides, we must assume our complete ignorance about their respective inner worlds. This means that, to our eyes at least, all their possible configurations are equally likely. Thusly, roads and junctions are Boltzmannian mesoscopic systems. Therefrom, the probability distribution Pr reduces to a function that depends only on the number of possible configurations Ω. Meanwhile, the functional entropy (3) simplifies to take the more sympathetic form On the other hand, here, self-organized urban street networks are idealized as scale-free systems, viz., as systems exhibiting no typical number of configurations but rather a typical scale λ. Thus, as suitable characterizing moments to invoke Jaynes's maximum entropy principle (Jaynes, 1957a,b;Kesavan, 2009), we must discard any classical moment and may consider logarithmic moments instead. It appears that imposing the first logarithmic moment Pr(Ω) ln Ω as sole characterizing constraint gives rise to the scale-free probability distribution A practical normalization of this probability distribution leads to the discrete Pareto probability distribution (Clauset et al., 2009). To sum up: the passage from the underlying Galoisean hierarchy to an underlying Paretian coherence occurs by invoking Jaynes's maximum entropy principle with the first logarithmic moment as sole characterizing moment and with complete ignorance as initial knowledge condition. For every road or junction having Ω possible configurations, the Boltzmann entropy ln Ω measures nothing but our complete ignorance on the configuration effectively taking place. So, our characterizing restriction simply claims that an idealized self-organized urban street network evolves by preserving our complete ignorance on average. This characterizing scheme that induces a Paretian coherence has been interpreted as some evolutionary based mechanism to maintain some opaque internal order (Dover, 2004;Milaković, 2001). Note furthermore that "complete ignorance" has rather remained, so far, a technical term. A more intuitive interpretation might be considered instead. If the Boltzmann entropy ln Ω is interpreted as the surprisal that city-dwellers associate to every road or junction having Ω possible configurations, then Pr(Ω) ln Ω becomes the amount of surprisal on average that they associate to their own urban street network. Surprisal (or surprise) Su = − ln • Pr was introduced by Tribus (1961) as a measure to quantify our astonishment and our indecision whenever we face any arbitrary event. Once adapted to our context, surprisal somehow betrays the perception of the city-dwellers for their own urban street network. Therefrom, the above Paretian characterizing constraint simply asserts that an idealized self-organized urban street network evolves by preserving on average the perception that its city-dwellers share for it. This assertion renders citydwellers the unconscious but nevertheless active actors of their own urban street networks, not the passive subjects of an obscure technical machinery. Along this line, the scale parameter λ of the underlying scale-free probability distribution (6) interprets itself as an evolution scale.

Untangling the Underlying Coherence
The underlying coherence, Paretian or not, does not reveal to city-dwellers as-is. Technically, we must still untangle the corresponding weight function w and valuation function Va with respect to the underlying algebraic structure, namely, with respect to composition (2) and addition rule (1). Practically, we need a mesoscopic model to count the number of configurations Ω associated to every road or junction. For the reason that roads and junctions are likely driven by social interactions, the mesoscopic model must typify social interactions. To fulfill this purpose, it appears convenient to adopt and adapt the network of intraconnected agents model introduced by Dover (2004) for the distribution of cities in countries. Thereby, each road or junction becomes a hive of agents that connect to each other. As agents, we may consider the inhabitants that somehow participate to the live activity of roads: drivers, cyclists, pedestrians, suppliers, institutional agents, residents, and so forth. For each road r, the number of agents is assumed to be asymptotically proportional to the number of junctions n r that r crosses -the ratio A being constant and sufficiently large. This expresses nothing but the extensive property of roads. Here the very existence of every road relies on the ability for each of its agent to maintain a crucial number of intraconnections which is crudely equal to a constant number υ r , called the number of vital connections for roads. The layout of theses intraconnections is implicitly associated to the internal order within each road, while the total number of possible layouts for each road is simplistically considered as its number of configurations (Dover, 2004). Therefrom, for each road r, the number of configurations Ω r yields As concerns each junction, continuing along this spirit, the involved agents are merely the agents of the two joining natural roads combined together. Nevertheless, as there is no apparent reason for roads and junctions to experience the same type of internal equilibrium, we will assume two distinct numbers of vital connections, υ r and υ j respectively. Then the same crude maneuvers give Therefore, the valuation function Va appears clearly to assign to each road or junction the number of its agents, and the weight function w to asymptotically count the number of possible vital intraconnection layouts -modulo normalization.

Ideal Self-Organized Urban Street Networks Coherence Based on Boltzmannian Mesoscopic Surprisals
It is time now to explicitly invoke Jaynes's maximum entropy principle for the functional entropy (4) with the first logarithmic moment (5) The constraint relative to the Lagrange multiplier λ compels to keep constant the first logarithmic moment (5) of the probability distribution Pr; namely, it imposes the preservation on average of the amount of surprisal that city-dwellers perceive for their roads and junctions. Meanwhile, the Lagrange multiplier ν ensures the normalization condition that the probability distribution Pr must satisfy. The constant S stands for the constant mean surprisal at which the system evolves -for now it plays a dummy role. Extremizing expression (8) yields ∂L ({Pr (Ω)} ; ν, λ) ∂ Pr(Ω) = − ln (Pr(Ω)) − ν − λ ln Ω = 0, which immediately leads to the scale-free probability distribution as previously claimed. Afterwards, the normalization condition effortlessly gives us an expression for the dependent exponential denominator exp(ν), which may be defined as the partition function Z(λ) of our system; we have Ultimately, we write solution (10) in the more familiar form The found most plausible probability distribution (12) concerns the underlying coherence of our system. As such, this coherence can only be perceived indirectly by the city-dwellers of the urban street network. The city-dwellers may rather perceive the coherence behind roads and junctions. Their corresponding statistics are obtained as follows. Substituting (7a) into (12), we readily obtain for roads which is a scale-free probability distribution. Injecting instead (7b) into (12), then gathering and counting with respect to the precedent probability distribution (13a) gives for junctions which is a generalized power law probability distribution; the summation in parentheses is simply the self-convolution of the road probability distribution (13a). The bracket around the equality statement follows Iverson's convention (Graham et al., 1994;Knuth, 1992): the bracket has value one whenever the bracketed statement is true, zero otherwise. The number of junction n r that a road crosses is essentially the number of roads with which it shares a common junction, namely, its valence number in the corresponding road-road network. So the probability distribution (13a) predicts the valence distribution for roads that has been widely observed empirically among self-organized cities (Crucitti et al., 2006;Jiang, 2007;Jiang and Claramunt, 2004;Jiang et al., 2008Jiang et al., , 2014Porta et al., 2006a,b). A similar argument dually applies for junctions. However, to the best of our knowledge, the valence distribution for junctions has brought no attention until now -except in our recent investigations.
For practical data analysis (Clauset et al., 2009), we need to assume that the number of junctions per road n r spans from some minimal positive value n r . Then the normalization of probability distributions (13) can be performed elegantly by using natural generalization of known special functions. First, the probability for a road to cross n r junctions becomes where ζ (α; a) = ∞ n=0 (a + n) −α is the generalized (or Hurwitz-) zeta function (Olver et al., 2010, § 25.11). Second, the probability for a junction to see n j junctions through its joining roads reads where W (α, β, γ; n) = m,n n m −α n −β (m + n) −γ is the two-dimensional generalized (or Hurwitz-) Mordell-Tornheim-Witten zeta function (Borwein and Dilcher, 2018). As a conclusion, let us remark that statistics (14) for an ideal self-organized urban street network does not separate the macroscopic parameter λ from the mesoscopic ones υ r and υ j in the sense that, at best, we can only estimate the products λυ r and λυ j . This separation of parameters is critical since it would allow us to distinguish quantitatively the macroscopic phenomenon of evolution from the mesoscopic phenomena of social interactions that take place in urban street networks. Notice that, from a qualitative perspective, two distinct behaviours are anticipated. The numbers of vital connections υ r and υ j certainly differ from one cultural basin to another one (Dover, 2004). Whereas the evolution scale λ might transcend cultures (West, 2017). A classical way to separate parameters in Physics consists to introduce sufficiently small perturbations. This is, in its observational form, the subject of the next subsection. Figure 1 exhibits the urban street network of Central London as a case study. The probability distribution for roads Pr (n r ) (14a) is highly plausible, as expected for any recognized self-organized city (Alexander, 1965;Jiang et al., 2014). However, for the time being, the validation of the probability distribution for junctions Pr (n j ) (14b) appears more delicate. This is due to the emergence of a numerical bottleneck as follows. The state-of-the-art statistical method to either validate or reject a plausible hypothesis for power law probability distributions is based on Maximum Likelihood Estimations (MLE) (Clauset et al., 2009). Besides invoking a numerical minimizer (Press et al., 2007), this method requires sampling (Clauset et al., 2009), that is, the input sample must be compared to a large set of randomly generated samples -the larger, the more precise. In the present case, this means that the numerical evaluation of the normalizing functions ζ and W -and of their respective logarithms and logarithmic derivatives -have to be efficient not only in terms of precision but also in terms of speed. Efficient numerical methods to evaluate the Hurwitz-zeta function ζ can be found in the classical numerical literature (Oldham et al., 2009;Olver et al., 2010) -while they can easily be adapted  (1)). The green fitted curve for the junction statistics shows the best Nonlinear Least-Squares Fitting (NLSF) for the nonstandard discrete probability distribution (14b) with n r and 2λυr fixed to their respective MLE value (2λυ j ≈ −1.3); since fast evaluation of the normalizing function W has yet to be found, no MLE approach can be used for now. Having a number of vital connections υ j negative means that the associated generalized binomial combination number is smaller than one, i.e., that the number of agent intraconnections for junctions is relatively much smaller than the one for natural roads.

Case Study of Central London
to our specific usage. By contrast, the two-dimensional Mordell-Tornheim-Witten zeta function W belongs to the specialized numerical literature and its numerical computation is still a subject of investigation (Borwein and Dilcher, 2018). In practice, even the implementation of the corresponding Hurwitz generalization with the same two first exponents α and β is rather tedious while very slow, especially when the third exponent γ becomes negative -as 2λυ j appeared to be. To work around this numerical bottleneck, we performed a crude data analysis based on a Nonlinear Least-Squares Fitting (NLSF). Interestingly, our ad hoc crude data analysis reveals a negative number of vital connection υ j , which means that the number of intraconnections for junctions might be relatively much smaller than the one for roads in self-organized cities.

Drifted Self-Organized Urban Street Networks
Coherence Based on Drifted Boltzmannian Mesoscopic Surprisals Now let us regard the self-organized urban street networks studied in the previous section as an ideal class of urban street networks, namely, as a reference from which 'real' urban street networks deviate. The deviation is vanishing for self-organized urban street networks. For arbitrary urban street networks, the deviation might be of arbitrary magnitude. Furthermore, we presume that deviations are essentially caused by artificial means, but not due to any change in the behaviours of the citydwellers. Artificial deviations are created by urban designers or decision makers who remodel cities for arbitrary purposes but without respect to the laws that might govern the spontaneous evolution of cities. Meanwhile, the topological mindset of city-dwellers and the social machinery that governs roads and junctions remain unchanged. Moreover, a priori, there are no apparent reason that the remodelling affects one iota the deep paradigm which constructs the perception of city-dwellers: roads and junctions remain perceived as Boltzmannian mesoscopic systems. Nevertheless, the remodeled urban street networks might no more reflect their perception -not vice versa. In other words, the deviations drift the surprisal of city-dwellers for their own urban street network. Assuming a surprisal drift Λ(Ω) that generates an extra amount of surprisal ∆ S on average, the unique characterizing constraint bracket in Shannon Lagrangian (8) becomes Carefully expanding (15) gives rise to two apparent characterizing restrictions: the first logarithmic moment characterizing constraint discussed above and a new characterizing constraint, respectively The introduced Lagrange multiplier ε tells us how urban designers or decision makers impose a surprisal drift Λ(Ω) to the surprisal perception of city-dwellers for their own urban street network. The constant ∆ S corresponds to the part of the apparent mean surprisal caused by the surprisal drift Λ(Ω) itself -for now, as the constant S , it plays a dummy role. Extremizing expression (17) holds from which we readily find the power law probability distribution With the same easy manipulation as before, the normalization condition allows us to define the deviant partition function Z(Λ; λ, ε) of our drifted system; we get So we end up by writing the most plausible probability distribution associated to Shannon Lagrangian (17) as For non-vanishing surprisal drift ε Λ(Ω), as expected, this probability distribution is obviously not scale-free. In fact, when the polynomial part of the asymptotic expansion of Λ(Ω) does not reduce to a constant, the surprisal drift ε Λ(Ω) acts as a cut-off function. In other words, in contrast to ideal self-organized urban street networks, a typical deviant urban street network possesses a typical number of configurations for its roads and junctions. Our next task is to establish the statistics for roads and junctions in deviant urban street networks. Substitution of (7a) into (12) yields once the surprisal drift Λ is suitably rescaled to the surprisal drift for roads Λ r . Afterwards, substitution of (7b) into (12) along Iversonian counting with respect to (22a) gives r,s) [n j = n r + n s ] (n r n s ) 2λυr exp −ε Λ r (n 2υr r ) + Λ r (n 2υr s ) with the same notation convention previously used.
The main interest of the deviant statistics (22) lies in showing how surprisal drift formally separates the evolution scale exponent λ from the numbers of vital connections for roads and junctions, υ r and υ j respectively. As seen in the previous subsection, this separation of parameters is important as it means that the macroscopic phenomenon of evolution and the mesoscopic phenomena of social interactions can be qualitatively studied among drifted self-organized urban street networks. Fortunately enough, such qualitative investigations among slightly drifted self-organized urban street networks appears almost as manageable as the ideal case investigation among self-organized urban street networks as follows.

Exploratory Case Study of Slightly Drifted Urban Street Networks
Let us first specify what we mean when a self-organized urban street network is slightly drifted. Here it is important to bear in mind that the numbers of configurations (7) result from asymptotic countings. So, the surprisal drifts for roads and junctions introduced in (22), Λ r and Λ j respectively, reach the asymptotic behaviour of the underlying surprisal drift Λ. Let us now assume that the underlying surprisal drift Λ(Ω) admits as asymptotic expansion a generic finite Laurent polynomial of the form a −p Ω −p + a −p+1 Ω −p+1 + · · · + a 0 + · · · + a q−1 Ω q−1 + a q Ω q . The non-polynomial part is absorbed by the exponential function whose the surprisal drifts feed, hence irrelevant. The zeroth order coefficient a 0 is eliminated during the normalization by factorizing its inverse exponentiation, hence meaningless. The remaining polynomial part a 1 Ω + · · · + a q−1 Ω q−1 + a q Ω q is imposed, by the normalization condition, to be positive for large Ω values. More importantly, the remaining polynomial behaves as an asymptotic cut-off polynomial whose strength lies in its leading term a q Ω q . We may consider surprisal drifts asymptotic to quadratic or of higher degree polynomials as inducing too drastic cut-offs, namely, as altering too drastically self-organized urban street networks. That is, for the time being, we consider as slight any surprisal drift that is asymptotic to a monomial of degree one a 1 Ω whose coefficient a 1 is arbitrary small -and positive. Thus, we may assume, without loss of generality, that the slight surprisal drift Λ(Ω) reduces to the canonical monomial Ω so that ε Λ(Ω) = ε Ω. Therefrom the parameter ε simply expresses the strength of our slight surprisal drift. Once properly rescaled, the parameter ε becomes the strengths ε r and ε j associated to the slight surprisal drifts for roads and junctions, respectively; we write ε Λ r (n 2υr r ) = ε r n 2υr Now we may carry out the statistics for roads and junctions in slightly deviant urban street networks. Substituting (23) into (22), then making the change of parameters for conciseness, we obtain Pr (n r ) ∝ n −2λυr For practical normalization, the change of parameters (24) appears precious to easily identify the involved special functions. First, the probability (14a) for a road to cross n r junctions in an idealized self-organized urban street network takes in a slightly deviant urban street network the form Pr (n r ) = n −2λυr r z r n 2υr r Φ (z r , 2λυ r , n r ; 2υ r ) , where Φ (z, α, a; β) = ∞ n=0 (a + n) −α z (a+n) β is the generalization introduced by Johnson (1974) of the Lerch transcendent function Φ (z, α, a) = ∞ n=0 (a + n) −α z n (Olver et al., 2010, § 25.14). Second, the concomitant probability (14b) for a junction to see n j junctions through its joining roads then transforms into where is introduced for the sake of completeness. The validation of the slightly deviant statistics (26) is more challenging than of the ideal statistics (14) from which it deviates for at least two reasons. Firstly, it is rather an exploratory work since we have no catalogue of slightly deviant urban street networks from which we can pick up relevant samples. Secondly, the involved normalizing functions Φ and W Φ are both computationally challenging. Nonetheless, the numerical bottleneck is again a bearer of both good news and bad news.
The bad news, without surprise, is that further investigations on the deviant probability distribution Pr (n j ) (26b) for junctions must be postponed too. This is simply because its normalizing function W Φ combines together the difficulties inherited from the normalizing functions W and Φ while, for the least, a fast numerical evaluation for the former has yet to be found. The good news is that a very efficient numerical evaluation already exists for the latter. In fact, this numerical evaluation was presented with the Lerch transcendent function as illustration (Aksenov et al., 2003). Formally, it consists to apply to the series a condensation transformation followed by a Levin d-transformation (Brezinski and Redivo Zaglia, 1991;Press et al., 2007). Technically, its adaptation to the generalized Johnson-Lerch transcendent function Φ is straightforward. In practice, a careful implementation written in C language that uses the Levin transformation encoded in the HURRY procedure (Fessler et al., 1983, Algo. 602) as implemented in the GNU Scientific Library (Galassi et al., 2009) appears efficient in terms of both precision and speed.

Conclusions and Future Works
The primary goal of our investigation is to understand the statistics of urban street networks. The objective of this research was two-fold. First, see how our recent results on self-organized urban street networks can be broaden to 'nearly' selforganized urban street networks. Second, learn what we can expect from this extension of our initial domain of investigation. The implicit idea behind this approach is that most urban street networks can be envisaged as a perturbation of a self-organized urban street network.
To start, we present the surprisal statistical physics model that we showed to govern self-organized urban street networks. Afterwards, by hand, we perturb the model by introducing a surprisal drift. We argue that the surprisal drift essentially results from artificial remodellings imposed by urban designers or decision makers. We obtain the generic statistics for arbitrarily drifted self-organized urban street networks, and most importantly, a practical statistics for the slightly drifted ones among them. All along we learn two important and practical properties. First, as expected whenever any perturbation occurs, surprisal drift perturbations lead to a separation of parameters. Here, perturbations separate the macroscopic evolution scale parameter from the mesoscopic social interaction parameters. Second, data analysis for validating the practical statistics for slightly drifted self-organized urban street networks remains manageable -modulo some numerical analysis efforts.
Future works must first and foremost validate the practical statistic for slightly drifted self-organized urban street networks for a sufficiently large bunch of 'real' urban street networks. Thereafter, the macroscopic and mesoscopic parameters must be estimated for a representative set of slightly drifted self-organized urban street networks in view to perform an observational qualitative investigation on the involved phenomena. This investigation is helpful to determine whether or not the macroscopic phenomena of evolution and the mesoscopic phenomena of social interactions transcend cultural basins. There exists evidences that the latters are cultural dependent. We believe, in contrast, that the macroscopic phenomena of evolution is characterized by an universal constant evolution scale that might reflect either spacial spanning, unconscious processes, or both; if so, an observational estimation must be isolated and ultimately some rational must be found. This sequence of observational investigations aims to confirm for self-organized urban street networks the status of reference among urban street networks. Once confirmed, it makes sense to compare the strengths of the surprisal drifts among a representative set of urban street networks. Because these strengths reflect the rational thoughts of urban designers and decision makers, we expect to observe a random (not to say irrational) set of data. On the other hand, since surprisal drifts might be perceived as a source of stress by city-dwellers, these strengths might have correlation with data that mark their disengagement for their own urban street networks. If such correlations are effectively observed, then the strength of surprisal drifts might be interpreted as a indicator of their disengagement, namely, as a quality measure of their city.

Availability of data and materials
The datasets generated and analysed during the current study are available from the corresponding author on reasonable request.