The Search for the Ideal Town to Live in Italy
I am on a quest to find the ideal town to call home in Italy. The concept of an “ideal town” is subjective, but the journey allows me to learn about the socioeconomic and geographical factors of my new country. With countless towns to consider, I needed to narrow down the list of options. While an objective ranking cannot provide the ultimate answer, it can help me define my preferences and simplify the decision-making process. To this end, I gathered data from Il Sole 24 Ore, an Italian newspaper, to classify Italian provinces.

Our current home, Bologna is great, but does not meet some of our needs. Last fall, my family and I made the move from San Francisco to Bologna in search of a new home closer to our families in Turkey and a chance to immerse ourselves in Italian culture. Bologna was our first choice because it is centrally located in North Italy, offers easy connections globally, including Turkey, and has a large enough English-speaking community to help ease the culture shock for our children. We planned to either settle down in Bologna or at least enjoy it while we planned our next step. While Bologna met most of our expectations, it also fell short in some areas. For example, the air pollution in the Po Valley was more significant than I had anticipated. Although Bologna is at the border of the valley, the air quality is quite poor. The city is lively, but also crowded and noisy, and it lacks the outdoor opportunities for kids that we took for granted in San Francisco. PM10 concentrations in the Po Valley on 18th December, 2021.
We have begun to think about our next move and plan to take a more informed approach this time. However, with so many beautiful towns in Italy, we need to narrow down our options. I began searching for objective metrics to help us make our decision. To my surprise, Bologna ranks first among all 107 provinces in Italy, according to a ranking by the Italian newspaper Il Sole 24 Ore. The ranking is based on 90 criteria that cover a wide range of areas, such as crime rate, home prices, length of cycling paths, access to health specialists, and green spaces. I do believe Bologna is a great place, but it being the top-ranking province in Italy? That was surprising. Taking a closer look at the methodology revealed why our perception and the ranking were not aligned: In its defense, Il Sole 24 Ore’s ranking is quite comprehensive and they developed a great site where you can dive into the individual dimensions and see rankings for each separate indicator. You can check it out yourself. Even if you don’t understand Italian, Google Translate does a good job. One issue with the overall ranking system is that it averages all indicators, without considering how they might overlap in measuring the same latent factors. The goal is to provide one general-purpose ranking using all the information available. For example, halving the crime rate is not necessarily as significant as doubling the length of bike paths in a region, but in this ranking it is. While this method of ranking may work for general purposes, it doesn’t consider the specific needs, values, and expectations of our family. Therefore, I wanted to use the data provided by Il Sole 24 Ore and create a new ranking that better aligns with our preferences. As a first step, I reviewed the 90 indicators and selected the ones that are most relevant to our family’s needs and values. I used the following four dimensions as a guide for determining which indicators to include: Social capital: I looked for indicators that reflect the level of social capital in an area along with the ease of integration for newcomers. It’s notoriously difficult to measure social capital directly in an area or community, but the literature suggests some proxies such as education, newspaper readership, organizational memberships, and participation in civic life. I selected overall education level, turnout in elections, percentage of immigrants living there, and the percentage of the population that is within our desired age range. I also included metrics related to disconnected youth (which I reverse-coded by using the percentage of “connected” youth) and the number of civil cases per 100,000 residents (which I also reverse-coded by converting it to the number of residents per civil case). Material well-being: I wanted to find out if the area has an efficient local government, a reliable public transportation system, a healthy population, easy access to public services, etc. These can be mostly captured by wealth and material development. I selected life expectancy at birth, a composite Urban Ecosystem score based on public transportation, garbage collection, etc., bike paths (km per person), green areas in the urban areas, a composite quality of life of women index based on 12 indicators such as gender equality in STEM, occupations and participation in sports. Safety: Coming from San Francisco, we are particularly sensitive about property crime and would love to live somewhere we don’t have to lock our bikes with the strongest steel chain money can afford even if we leave it for two minutes outside the market. I selected the overall criminality index, snatch thefts, and street robberies. Outdoors and sports: This is a part of life we care a lot about and our new home has to provide decent opportunities. I looked for indicators related to air quality, climate, and opportunities for outdoor activities.
Some other uncategorized indicators that I selected were composite scores for the quality of life for the elderly, young, and babies, housing prices, number of cafes, bookstores, and restaurants per person. I made some changes to the data to make it easier to compare different provinces in Italy. For example, I flipped the numbers for house prices and crime rates to make higher numbers mean better outcomes. I also converted all the scores to standardized units to make them easier to compare. This way, I can see which provinces are doing better or worse in different areas, like crime or housing prices. To make the data easier to interpret, I used a method called exploratory factor analysis. This technique helps to identify patterns in a large set of data and condenses it into a smaller set of factors. For example, a factor that represents Quality of Life can be inferred from measurements like life expectancy and gender equality. This allows me to focus on the latent factors that drive the observations, without worrying about the overlap and redundancy across different indices. Based on the correlation structure in my data, I identified four latent factors that could be used to explain the observed measurements. I will provisionally call them Factor A, B, C, and D, and will go over them one by one. Factor A - North and South The primary factor that explained the variations across provinces is a measure of wealth and economic development in an area. It’s positively associated with the quality of life for women, connected youth, high election turnouts, long life expectancy, and a strong urban ecosystem. However, it’s negatively associated with civil cases per capita, climate index, and number of days without rain. The negative correlation with two exogenous variables (rain and climate) suggests that there is a strong spatial component to Factor A because human activity is unlikely to have an effect at these scales. Factor A is also negatively correlated with air quality, which is consistent with the idea that wealth comes with industrial activity that can affect air quality. In areas with high Factor A, house prices are also high. Table: Indicators positively associated with Factor A. Indicators marked with * have flipped signs.
| Indicator | FactorA |
|---|---|
| Quality of life for women | 0.88 |
| *Disconnected youth | 0.86 |
| Electoral participation | 0.84 |
| Newspaper readership | 0.79 |
| *Litigation index | 0.75 |
| Kids | 0.74 |
| Life expectancy at birth | 0.73 |
| Regular resident immigrants | 0.72 |
| Urban ecosystem | 0.59 |
| Fitness, pools, well-being | 0.59 |
| Sports and outdoors | 0.57 |
| Bike paths | 0.57 |
| Urban green areas | 0.54 |
| People with at least high school degree | 0.53 |
| Table: Indicators negatively associated with Factor A. Indicators marked with * have flipped signs. |
| Indicator | FactorA |
|---|---|
| Consecutive days without rain | -0.73 |
| General practitioners | -0.61 |
| *Average house price | -0.55 |
| Climate | -0.52 |
| Air quality | -0.51 |
| Overall, I was pretty sure Factor A reflected the infamous north and south divide, and I was not mistaken. Here is the average Factor A score each province gets: | |
| The above map is visually simple, but to explain it one would need to consider complex historical, sociological, and economic factors beyond the scope of this report. | |
| Unfortunately, things associated with the development and urban life’s downsides tend to be correlated. Thus, Factor A is not a pure measure of the desirable qualities, but a mix. For example, between two areas that have the same scores in all dimensions except air quality, the one with lower air quality will have a higher Factor A score. This is because the EFA doesn’t differentiate between “good” and “bad” indicators - it estimates a factor that explains the data variance. Air quality contributes negatively to Factor A. | |
| The full ranked list of Italian provinces according to Factor A can be found below, and Bologna ranks 40th. | |
| Pos. | Province |
| —-: | :——————— |
| 1 | Treviso |
| 2 | Cremona |
| 3 | Vicenza |
| 4 | Lecco |
| 5 | Bergamo |
| 6 | Pordenone |
| 7 | Belluno |
| 8 | Lodi |
| 9 | Sondrio |
| 10 | Cuneo |
| 11 | Ravenna |
| 12 | Mantova |
| 13 | Monza e Brianza |
| 14 | Como |
| 15 | Verona |
| 16 | Reggio Emilia |
| 17 | Udine |
| 18 | Brescia |
| 19 | Piacenza |
| 20 | Padova |
| 21 | Biella |
| 22 | Parma |
| 23 | Varese |
| 24 | Ferrara |
| 25 | Modena |
| 26 | Forlì-Cesena |
| 27 | Prato |
| 28 | Verbano-Cusio-Ossola |
| 29 | Bolzano |
| 30 | Arezzo |
| 31 | Trento |
| 32 | Novara |
| 33 | Pavia |
| 34 | Alessandria |
| 35 | Siena |
| 36 | Pesaro e Urbino |
| 37 | Rovigo |
| 38 | Aosta |
| 39 | Firenze |
| 40 | Bologna |
| 41 | Venezia |
| 42 | Vercelli |
| 43 | Fermo |
| 44 | Pisa |
| 45 | Savona |
| 46 | Asti |
| 47 | Pistoia |
| 48 | Ancona |
| 49 | Rimini |
| 50 | Gorizia |
| 51 | Grosseto |
| 52 | Perugia |
| 53 | Milano |
| 54 | Lucca |
| 55 | Torino |
| 56 | Macerata |
| 57 | Livorno |
| 58 | Trieste |
| 59 | Terni |
| 60 | Ascoli Piceno |
| 61 | Massa-Carrara |
| 62 | La Spezia |
| 63 | Rieti |
| 64 | Genova |
| 65 | Oristano |
| 66 | Cagliari |
| 67 | Imperia |
| 68 | Teramo |
| 69 | Viterbo |
| 70 | L’Aquila |
| 71 | Chieti |
| 72 | Nuoro |
| 73 | Roma |
| 74 | Frosinone |
| 75 | Benevento |
| 76 | Matera |
| 77 | Sassari |
| 78 | Pescara |
| 79 | Potenza |
| 80 | Sud Sardegna |
| 81 | Latina |
| 82 | Lecce |
| 83 | Bari |
| 84 | Avellino |
| 85 | Campobasso |
| 86 | Enna |
| 87 | Brindisi |
| 88 | Isernia |
| 89 | Barletta-Andria-Trani |
| 90 | Ragusa |
| 91 | Vibo Valentia |
| 92 | Caserta |
| 93 | Cosenza |
| 94 | Agrigento |
| 95 | Taranto |
| 96 | Foggia |
| 97 | Salerno |
| 98 | Trapani |
| 99 | Crotone |
| 100 | Messina |
| 101 | Reggio Calabria |
| 102 | Catanzaro |
| 103 | Palermo |
| 104 | Catania |
| 105 | Caltanissetta |
| 106 | Napoli |
| 107 | Siracusa |
| Factor B: Urban versus Rural | |
| Factor B is positively associated with density and population, which indicates the difference between urban and rural areas. High Factor B scores (more “urban” areas) have higher crime rates, smaller apartments, higher fiber internet penetration, and higher housing prices. Negatively associated with Factor B are higher crime rates, less living space, higher rent-to-income ratios, and higher house prices. | |
| Table: Indicators positively associated with Factor B |
| Indicator | FactorB |
|---|---|
| Population density | 0.66 |
| High-speed internet utilization | 0.64 |
| Population | 0.62 |
| High-speed internet availability | 0.62 |
| And here are the indicators negatively associated with Factor B. Remember that in this report, all undesirable attributes are reverse-coded so that higher values mean better outcomes. So, for a reverse-coded variable such as *Crime rate, a negative association is a bad thing. It means the areas with higher values of Factor B also have higher crime. In addition to higher crime, higher values of Factor B are also associated with less living space, higher rent-to-income ratio, and higher house prices. | |
| Table: Indicators negatively associated with Factor B. Indicators marked with * have flipped signs. |
| Indicator | FactorB |
|---|---|
| *Crime index | -0.77 |
| *Snatch thefts | -0.76 |
| Living space | -0.70 |
| *Street robberies | -0.70 |
| *Rent/income ratio | -0.67 |
| *Average house price | -0.59 |
| While Factor B captured a real-world phenomenon, it is not useful for finding the ideal Italian town since it’s difficult to find a place with the benefits of urban areas (fiber internet) without the downsides (high crime). | |
| Looking at the scatter plot of Factor A and Factor B reveals an interesting pattern of Italian provinces. The horizontal axis is the value of Factor A, which for our purposes can be interpreted as North and South. The vertical axis is Factor B, which we can interpret as the urban/rural distinction. In the South, large metropolitan centers such as Napoli, Palermo, and Catania are extreme points. In the North, Milano, Firenze, Bologna, and Rimini reflect similar urban scores. | |
| Pos. | Province |
| —-: | :——————— |
| 1 | Roma |
| 2 | Napoli |
| 3 | Milano |
| 4 | Genova |
| 5 | Firenze |
| 6 | Torino |
| 7 | Bologna |
| 8 | Rimini |
| 9 | Prato |
| 10 | Bari |
| 11 | Palermo |
| 12 | Trieste |
| 13 | Venezia |
| 14 | Catania |
| 15 | Imperia |
| 16 | Livorno |
| 17 | Pescara |
| 18 | Pisa |
| 19 | Parma |
| 20 | Barletta-Andria-Trani |
| 21 | Foggia |
| 22 | Bergamo |
| 23 | Padova |
| 24 | Varese |
| 25 | Verona |
| 26 | Salerno |
| 27 | Monza e Brianza |
| 28 | Brescia |
| 29 | Modena |
| 30 | Forlì-Cesena |
| 31 | Cagliari |
| 32 | Pavia |
| 33 | Latina |
| 34 | Ravenna |
| 35 | Messina |
| 36 | Lucca |
| 37 | Taranto |
| 38 | Perugia |
| 39 | Savona |
| 40 | Sassari |
| 41 | Siracusa |
| 42 | Grosseto |
| 43 | Pistoia |
| 44 | Reggio Emilia |
| 45 | Ferrara |
| 46 | Massa-Carrara |
| 47 | Como |
| 48 | Caserta |
| 49 | Piacenza |
| 50 | La Spezia |
| 51 | Lecce |
| 52 | Ragusa |
| 53 | Novara |
| 54 | Trapani |
| 55 | Ancona |
| 56 | Bolzano |
| 57 | Arezzo |
| 58 | Fermo |
| 59 | Brindisi |
| 60 | Mantova |
| 61 | Lecco |
| 62 | Vicenza |
| 63 | Catanzaro |
| 64 | Terni |
| 65 | Pesaro e Urbino |
| 66 | Teramo |
| 67 | Alessandria |
| 68 | Sud Sardegna |
| 69 | Lodi |
| 70 | Treviso |
| 71 | Reggio Calabria |
| 72 | Cremona |
| 73 | Siena |
| 74 | Caltanissetta |
| 75 | Trento |
| 76 | Crotone |
| 77 | Rovigo |
| 78 | Chieti |
| 79 | Aosta |
| 80 | Ascoli Piceno |
| 81 | Vercelli |
| 82 | Asti |
| 83 | Agrigento |
| 84 | Gorizia |
| 85 | Viterbo |
| 86 | Udine |
| 87 | Matera |
| 88 | Biella |
| 89 | Macerata |
| 90 | Cuneo |
| 91 | L’Aquila |
| 92 | Cosenza |
| 93 | Nuoro |
| 94 | Benevento |
| 95 | Campobasso |
| 96 | Pordenone |
| 97 | Verbano-Cusio-Ossola |
| 98 | Isernia |
| 99 | Frosinone |
| 100 | Rieti |
| 101 | Avellino |
| 102 | Sondrio |
| 103 | Vibo Valentia |
| 104 | Enna |
| 105 | Potenza |
| 106 | Oristano |
| 107 | Belluno |
| Factor C: Demographics? Retirees? College students? | |
| I’ve been unable to interpret this factor. It seems to be related to demographical dynamics and the age of the population. It is positively associated with the number of cafes and restaurants per person. It is negatively associated with the birth rate and ratio of people in our age cohorts (kids between 0–14 and adults between 30–50). | |
| It is difficult to use this factor directly to rank provinces. We are looking for an area that is both high in the number of cafes and restaurants, and the ratio of our age peer groups among the population. | |
| Table: Indicators positively associated with Factor C |
| Indicator | FactorC |
|---|---|
| Cafes | 0.64 |
| Restaurants | 0.58 |
| Indicators positively associated with Factor C: | |
| Table: Indicators negatively associated with Factor C |
| Indicator | FactorC |
|---|---|
| Ratio of 0-14 ages | -0.91 |
| Birth rate | -0.89 |
| Ratio of 30-50 ages | -0.53 |
| The highest-scoring areas were Massa-Carrara, Savona, Genova, Grossetto, Imperia, and South Sardinia. Lowest-scoring areas were Bolzano, Trento, Lodi, Caserta, Crotone, and Reggio Emilia. I am inclined to interpret this as how dynamic the population is in an area with higher Factor C values corresponding to an older, stagnant population that enjoys bars and coffees and low values indicating a younger population with more kids. | |
| Pos. | Province |
| —-: | :——————— |
| 1 | Savona |
| 2 | Genova |
| 3 | Massa-Carrara |
| 4 | Grosseto |
| 5 | Imperia |
| 6 | Sud Sardegna |
| 7 | Cagliari |
| 8 | Trieste |
| 9 | Sassari |
| 10 | Livorno |
| 11 | Ferrara |
| 12 | Biella |
| 13 | Rovigo |
| 14 | Alessandria |
| 15 | La Spezia |
| 16 | Nuoro |
| 17 | Terni |
| 18 | Oristano |
| 19 | Campobasso |
| 20 | Lucca |
| 21 | Verbano-Cusio-Ossola |
| 22 | Isernia |
| 23 | Rieti |
| 24 | Ascoli Piceno |
| 25 | Udine |
| 26 | Vercelli |
| 27 | Gorizia |
| 28 | Firenze |
| 29 | L’Aquila |
| 30 | Venezia |
| 31 | Torino |
| 32 | Rimini |
| 33 | Viterbo |
| 34 | Fermo |
| 35 | Lecce |
| 36 | Potenza |
| 37 | Brindisi |
| 38 | Belluno |
| 39 | Pistoia |
| 40 | Perugia |
| 41 | Asti |
| 42 | Roma |
| 43 | Chieti |
| 44 | Teramo |
| 45 | Siena |
| 46 | Ancona |
| 47 | Messina |
| 48 | Pavia |
| 49 | Arezzo |
| 50 | Ravenna |
| 51 | Pisa |
| 52 | Aosta |
| 53 | Bologna |
| 54 | Taranto |
| 55 | Pescara |
| 56 | Benevento |
| 57 | Bari |
| 58 | Pesaro e Urbino |
| 59 | Piacenza |
| 60 | Avellino |
| 61 | Matera |
| 62 | Forlì-Cesena |
| 63 | Milano |
| 64 | Macerata |
| 65 | Prato |
| 66 | Novara |
| 67 | Frosinone |
| 68 | Trapani |
| 69 | Catanzaro |
| 70 | Enna |
| 71 | Padova |
| 72 | Foggia |
| 73 | Cosenza |
| 74 | Salerno |
| 75 | Latina |
| 76 | Varese |
| 77 | Lecco |
| 78 | Cremona |
| 79 | Caltanissetta |
| 80 | Como |
| 81 | Siracusa |
| 82 | Parma |
| 83 | Mantova |
| 84 | Agrigento |
| 85 | Vibo Valentia |
| 86 | Sondrio |
| 87 | Napoli |
| 88 | Barletta-Andria-Trani |
| 89 | Reggio Calabria |
| 90 | Modena |
| 91 | Verona |
| 92 | Pordenone |
| 93 | Palermo |
| 94 | Monza e Brianza |
| 95 | Vicenza |
| 96 | Brescia |
| 97 | Catania |
| 98 | Treviso |
| 99 | Cuneo |
| 100 | Bergamo |
| 101 | Ragusa |
| 102 | Reggio Emilia |
| 103 | Crotone |
| 104 | Caserta |
| 105 | Lodi |
| 106 | Trento |
| 107 | Bolzano |
| Factor D: Education | |
| Factor D is positively associated with years spent in education, % of people with at least a high school degree, and % of people with university degrees or trade schools and colleges. None of the indicators in my dataset had a negative substantial loading on Factor D. | |
| Table: Indicators positively associated with Factor D |
| Indicator | FactorD |
|---|---|
| Years of study | 0.82 |
| People with at least high school degree | 0.81 |
| Post secondary graduates | 0.63 |
| Pos. | Province |
| —-: | :——————— |
| 1 | L’Aquila |
| 2 | Pescara |
| 3 | Roma |
| 4 | Ancona |
| 5 | La Spezia |
| 6 | Avellino |
| 7 | Trento |
| 8 | Genova |
| 9 | Bologna |
| 10 | Trieste |
| 11 | Perugia |
| 12 | Frosinone |
| 13 | Milano |
| 14 | Verona |
| 15 | Rieti |
| 16 | Isernia |
| 17 | Gorizia |
| 18 | Monza e Brianza |
| 19 | Pisa |
| 20 | Chieti |
| 21 | Terni |
| 22 | Matera |
| 23 | Firenze |
| 24 | Modena |
| 25 | Bolzano |
| 26 | Siracusa |
| 27 | Campobasso |
| 28 | Udine |
| 29 | Cosenza |
| 30 | Salerno |
| 31 | Cagliari |
| 32 | Parma |
| 33 | Ascoli Piceno |
| 34 | Potenza |
| 35 | Reggio Emilia |
| 36 | Viterbo |
| 37 | Siena |
| 38 | Torino |
| 39 | Vibo Valentia |
| 40 | Pesaro e Urbino |
| 41 | Belluno |
| 42 | Padova |
| 43 | Teramo |
| 44 | Pordenone |
| 45 | Macerata |
| 46 | Caserta |
| 47 | Ravenna |
| 48 | Latina |
| 49 | Savona |
| 50 | Aosta |
| 51 | Livorno |
| 52 | Rimini |
| 53 | Bari |
| 54 | Agrigento |
| 55 | Como |
| 56 | Massa-Carrara |
| 57 | Lucca |
| 58 | Reggio Calabria |
| 59 | Sondrio |
| 60 | Varese |
| 61 | Forlì-Cesena |
| 62 | Palermo |
| 63 | Ferrara |
| 64 | Catanzaro |
| 65 | Enna |
| 66 | Venezia |
| 67 | Pavia |
| 68 | Vicenza |
| 69 | Lecco |
| 70 | Benevento |
| 71 | Asti |
| 72 | Piacenza |
| 73 | Catania |
| 74 | Vercelli |
| 75 | Verbano-Cusio-Ossola |
| 76 | Novara |
| 77 | Ragusa |
| 78 | Arezzo |
| 79 | Lodi |
| 80 | Pistoia |
| 81 | Brescia |
| 82 | Treviso |
| 83 | Messina |
| 84 | Alessandria |
| 85 | Cuneo |
| 86 | Brindisi |
| 87 | Crotone |
| 88 | Napoli |
| 89 | Grosseto |
| 90 | Caltanissetta |
| 91 | Biella |
| 92 | Cremona |
| 93 | Trapani |
| 94 | Mantova |
| 95 | Sud Sardegna |
| 96 | Taranto |
| 97 | Barletta-Andria-Trani |
| 98 | Foggia |
| 99 | Lecce |
| 100 | Sassari |
| 101 | Nuoro |
| 102 | Bergamo |
| 103 | Fermo |
| 104 | Imperia |
| 105 | Rovigo |
| 106 | Oristano |
| 107 | Prato |
| Overall, Bologna ranks |
- 40th out of 107 in Factor A (material well-being)
- 7th out of 107 in Factor B (negative urbanism)
- 53rd out of 107 in Factor C (retirees and college students?)
- 9th out of 107 in Factor D (education) This is a far cry from the top place in Il Sole 24 Ore’s ranking. However, because the factors themselves are mixed-valued, I cannot use them as is for ranking provinces. It’s time to introduce our values and priors to the ranking. Our Personal Ranking While exploratory factor analysis helped me explore the data, the factors it produced were positively correlated with both good and bad indicators. EFA can’t generate “pure” factors that align with our personal values, which makes it difficult to make trade-offs between competing factors that determine livability in a town for me. Based on the previous analysis, I created 6 sub-dimensions that capture what we care about in a province. Each sub-dimension includes indicators that are positively correlated with each other, allowing me to create sub-indices that directly reflect our priorities. Wealth / Material quality of life Quality of life for women, *Disconnected youth (reverse coded as connected youth) Electoral participation *Litigation index (reverse coded) Newspaper readership Kids Life expectancy at birth Regular resident immigrants Bike paths Urban ecosystem Fitness, pools, well-being Urban green areas
Sports and Outdoors: Scores for regional opportunities and infrastructure for Swimming Nature tourism Cycling Winter sports
Safety *Snatch thefts (reverse coded) *Crime index (reverse coded) *Street robberies (reverse coded)
Street-life Number of restaurants per resident Number of cafes per resident Number of bookstores per resident
Peers Population (log-scaled, if everything else is equal we don’t mind more people around) % of the population between ages 0 and 14 % of the population between ages 30 and 50
Education Average years spent in the educational system % population with at least a high school degree % of people with post-secondary (college, trade schools, etc.) education
A very important dimension that is missing here is air quality and the weather (e.g., average sunlight received in winter, the number of days that are too hot or too cold). These metrics are very location specific and can change from one town to another even in the same province, especially in mountainous areas. The dataset I have only reports air quality for the capital town of each province and I am not satisfied with its quality. In any case, once I determine a short list of towns to check out it will be an easy exercise to find relevant data for these qualities and I don’t have to track down a reliable air quality dataset for all cities in Italy. I converted all of the above individual indicators to z-scores (representing how many standard deviations a province is away from the national average) and took the average within each sub-dimension, giving me 6 scores for each province. The correlation table reveals that some indices are negatively correlated (e.g., Safety and Peers, Street Life and Peers), suggesting a trade-off between competing goals. Others are positively correlated (Education, Outdoors and Sports, and Quality of Life), indicating that they might not be differentiated enough and could be capturing a common factor. Nonetheless, the correlations are neither too high nor too low, giving me confidence that these six dimensions are not excessively redundant. Considering our preferences, the three dimensions - Wealth, Sports/Outdoors, and Safety - hold the most significance for us when ranking towns. We believe that higher material quality of life, better access to sports and outdoor opportunities, and lower crime rates are always desirable, regardless of their current levels. On the other hand, the remaining three dimensions - Education, Street Life, and Peers - offer diminishing returns. For instance, we don’t mind if the percentage of our age peers is 60% or 70%, as long as it’s not as low as 20%. Regarding safety, however, I was concerned that I was over-indexing on this dimension. I tried to compare San Francisco and Bologna (two cities we are familiar with) to calibrate the relative safety scores of Italy with our perceived safety in the States. Turns out Bologna, even though it’s among less safe provinces in Italy, is actually comparable to or safer than San Francisco with respect to burglary and murder. Bologna had less than half of the burglary rate in San Francisco (321 burglaries per 100K residents versus 722 per 100K residents) and almost a tenth of SF’s murder rates (0.7 murders per 100K versus 7 murders per 100K). Even allowing for wide discrepancies in specific crime type definitions and reporting rates, it’s clear San Francisco would rank as one of the least safe cities had it been placed in Italy. So I decided not to put too much weight on Safety either. As a result of these considerations, I decided to account for our non-linear preferences, by eliminating the bottom quintile of provinces in each of the 6 sub-dimensions first, and then ranking them according to the average score over the most salient 2 sub-dimensions (Quality of Life and Sports and Outdoors). The initial filter left me with 26 provinces that do not rank among the worst quintile in either of the sub-dimensions. The top 15-ranking provinces according to the average Quality of Life and Sports and Outdoors (and the quintile groups they are in):
| Province | Average Score | QualityOfLife | OutdoorsAndSports |
|---|---|---|---|
| Sondrio | 1.76 | 1 | 1 |
| Trento | 1.69 | 2 | 1 |
| Aosta | 1.51 | 3 | 1 |
| Verona | 1.28 | 1 | 1 |
| Udine | 1.04 | 1 | 1 |
| Brescia | 0.89 | 2 | 1 |
| Cremona | 0.58 | 1 | 2 |
| Pesaro e Urbino | 0.47 | 2 | 2 |
| Ancona | 0.38 | 2 | 2 |
| Piacenza | 0.38 | 2 | 2 |
| Cagliari | 0.28 | 3 | 2 |
| Pistoia | 0.24 | 3 | 2 |
| Siena | 0.14 | 2 | 3 |
| Ascoli Piceno | 0.11 | 3 | 2 |
| Lucca | 0.09 | 3 | 2 |
| Three regions account for 9 of the top 15 provinces: | |||
| Lombardy (Sondria, Cremona, Brescia) | |||
| Tuscany (Pistoia, Siena, Lucca) | |||
| Marche (Pesaro e Urbino, Ancona, Ascoli Piceno). | |||
| The remaining 6 positions are shared by Trentino-Alto Adige (Trento), Aosta Valley (Aosta), Veneto (Verona), Friuli-Venezia Giulia (Udine), Emilia-Romagna (Piacenza), and Sardinia (Cagliari). |
The top 3 positions (Sondrio, Trento, and Aosta) are northern and Alpine provinces that offer many outdoor opportunities and the highest quality of life scores. Trento ranking the 2nd is a strong confirmatory evidence for me that the ranking is working as expected, as I lived in Trentino-Alto Adige while doing my Ph.D. and it’s one of those rare paradise-like places on Earth where you can do road biking, mountain biking, skiing, hiking, and wind-surfing all within less than one-hour driving distance from your home. Unfortunately, it is too far away from major airports which I do not include as a dimension yet - a common aspect it shares with Sondrio and Aosta. A photo from a biking trip I took in Alto Adige in 2019.A surprising finding was the region of Marche which is represented by three provinces in the top-15: Pesaro e Urbino, Ancona, and Ascoli Piceno. We’ve never been to Marche before and never thought about it as a potential destination. My next step will be to read about and plan some trips to Marche. Another finding that made me reconsider some of my positions was Siena and Verona. Instinctively, I tend to discard touristic places as serious candidates, but Siena seems to have relatively high scores with Safety being in the top quintile and may deserve serious consideration. The same goes for Verona. Conclusion I have confidence my rankings work. The one province where I would spend the rest of my life if travel logistics was not a constraint is ranking 2nd. Bologna would not be among the candidates if we relied only on this analysis. Mainly because it couldn’t make the cut for the Safety dimension. I am still on the fence about this. It will be difficult to leave the trauma of our bikes stolen in San Francisco behind and I’ll keep thinking about Safety for a while. Verona and Siena, while being touristic, may deserve a second chance. Marche seems like an interesting region that I need to look into more. My next step will be to expand this list to actual Italian towns (Comune) that are located in these provinces and look at the driving times to major airports that we want to stay nearby.
source: