Vol. 4 Issue 1
February 2009

Once upon a time, in a land far, far away, gigantic, multicolored butterflies could befriend sea monsters living in the warm waters off the coast of a magical island. The golden light of an ever-shining sun would reflect the magnificent colors of the butterflies' wings and create iridescent rainbows ... WAIT!! WHAT?? This doesn't sound like a software story. No, we haven't been hallucinating, reading Kubla Khan or listening to Pink Floyd. But we have been building a database for Philadelphia's spectacular Mural Arts Program, generating psychedelic cost distance maps for a new research project and juxtaposing historic maps with contemporary research on the AfricaMap project. Welcome to another edition of the Azavea Journal!

AfricaMap: Azavea and Harvard Created a Web-based Search Tool for Exploration of a Historically Significant Collection of Maps of Africa

"Harvard University Geospatial Infrastructure (HUG) [was designed to] bring many... unconnected Africa datasets together in a single, easy-to-access web application that would promote collaboration and enable researchers to learn from other areas of study."

The most powerful tools often begin with a desire to solve simple, everyday problems. At Harvard University, faculty, students, and researchers often found that finding maps and spatial data related to their studies of Africa was extremely difficult. The issue was not that the data didn’t exist. In fact, the Harvard Map Collection has an impressive collection of historical maps of Africa. Many researchers also develop detailed Africa datasets in the course of their work, while other important spatial data is scattered across other organizations. But these maps and datasets had to be tracked down individually, assuming the researcher was even aware of them at all.

View of AfricaMap’s 1959 Ethnographic Map Layer Including Airfields

But at Harvard University’s Center for Geographic Analysis, Professors Suzanne Blier and Peter Bol with Senior Analyst Ben Lewis saw beyond simply creating a common repository for these maps. They envisioned a solution that would bring many of these currently unconnected Africa datasets together in a single, easy-to-access web application that would promote collaboration and enable researchers to learn from other areas of study. By layering the maps on top of each other, a researcher could explore all of the data or knowledge captured in maps from various disciplines. Knowledge of an area of interest could be deepened by maps describing historical, environmental, social, linguistic, or economic data. And by creating a map of scholarly projects focused on Africa, users could discover the work of others with interest in common geographical areas, despite differences in their fields of study. The vision for the Harvard University Geospatial Infrastructure (HUG) platform was born.

Ben Lewis developed an innovative, highly scalable, spatial search and display architecture to address these ambitious goals. By utilizing open standards and protocols, the framework would interoperate in the future with other technical systems used by scholars in various disciplines. By committing to an open source toolset and codebase, the framework could be applied to other areas of the world and be shared with other organizations who could, if they wanted, use and extend the framework. With the vision in place, he approached Azavea and MetaCarta to build the application.

MetaCarta built the map tiling system, while Azavea was asked to flesh out the framework with searching capabilities and some advanced features that presented fascinating technical challenges for us. One of our goals was to enable researchers to search through millions of places (ranging from populated places to physical features) using a straightforward text search, like a Google search, and have the results highlighted on the map. That work began by building a gazetteer, which is a geographic dictionary or directory — like a yellow pages for geographic place names. The initial data source for the gazetteer was the GeoNames database, which is a free geographic database of over eight million geographical names. Users can add or edit place names online, as if it were Wikipedia for place names. GeoNames doesn’t just include populated places like cities or villages, it also includes features such as farms, streams, wells, and schools.

Detail from a historic map of Africa by cartographer Jodocus Hondius,
circa 1612. “We were particularly struck by the beautiful juxtaposition
of the old and new: seeing an image of a sea creature on the edge
of a historical map, layered on top of a modern web map.” – Josh
Marcus & Reed Lauber

After processing and filtering GeoNames into a geographical database, we added two ways to view the places on a map. We added a ‘Places’ tab where one can view places by type. Here a researcher can turn on any combination of hundreds of ‘place types’ and zoom to any area of the map, clicking on’ place features’ for information about these places. In addition we made it possible to turn on all ‘place types’ and view them along with the many other layers in the system. We also made it possible to combine place name searches with ‘place type’ queries.

But first we had to tackle a technical challenge: Africa is a huge continent and there are a lot of places to show. The gazetteer has over three hundred thousand populated places, and we needed to figure out how to efficiently display all of them on the map. We profiled two open source mapping servers, one called GeoServer and another called MapServer, for the speed of image generation, and ended up with a conundrum. We preferred GeoServer’s cartography, especially for showing overlapping areas of scholarly study. But we were able to cajole MapServer to generate a map layer of hundreds of thousands of points very quickly. In the end, while we knew it would be somewhat unorthodox to use two mapping servers in the same application, we decided to use both systems for what they were good at: GeoServer for general cartography and MapServer to show the points from the gazetteer. The technology tools were rounded out with PostGIS for storing and searching feature queries, ExtJS for user interface components, and Python to tie it all together.

Once the gazetteer and visualization were in place, we were able to leverage the power of Web Map Service (WMS), an open standard to make web requests for maps, to transform the text search into a search for the geographic points in all of the map layers. The result of a text search could include features from the gazetteer, but also other searchable layers as well, such as the layer of scholarly projects. We added functionality so that a user could search many layers at once, click on the map to “drill down” and return results about different types of features in a single interface, and a range of other tools including a “permalink” that would enable users to share a particular view of the maps and search results with other users or students. The database also includes the geographic extent of research projects that target the African continent.

Now that the first phase of the project is complete, an initial public release of AfricaMap is available online and Azavea will be working with Harvard’s Center for Geographic Analysis to expand this project to apply the same framework to the Boston Metropolitan area. Take a look at the application for yourself. We especially recommend taking a look at the historical maps. We were particularly struck by the beautiful juxtaposition of the old and new: seeing an image of a sea creature on the edge of a historical map, layered on top of a modern web map. Click here to see an early 17th century map from Jodocus Hondius, and view other historical maps in the ‘Map Layers’ tab.

Another interesting way to explore the application is to put yourself in the mindset of a researcher. Imagine you are interested in the economic development of Freetown, in Sierra Leone. The ‘About’ tab has some useful documentation regarding searches and turning off and on map layers. The very detailed basemap (Freetown 2.5k) provides a strong basis for all of your work. By comparing other basemaps (the American Sierra Leone 50k map from the 70s and the Russian 500k transportation map from the 80s) you can observe new development over time. From there, you can draw in whatever other maps that are relevant for your work — some examples would include soils, population, language areas, ethnographic regions, or turn on the projects layer and explore other scholarly projects that have focused on your area. Enjoy!

Muralfarm.org: How a Remarkable Public Art Collection is Benefiting from GIS

"Thanks to Muralfarm.org, one of the largest outdoor art galleries in the world can be explored, searched, and admired from one's own living room."

The Philadelphia Mural Arts Program is the nation’s largest mural program. Since 1984, the Mural Arts Program has created nearly 3,000 murals and works of public art in Philadelphia, which are now part of Philadelphia’s civic landscape and a source of inspiration to the thousands of residents and visitors who encounter them, earning Philadelphia international recognition as the “City of Murals.” The Mural Arts Program also engages over 100 communities each year in the transformation of neighborhoods through the mural-making process, and their free art education programs annually serve over 3,000 youth at sites throughout the city and at-risk teens through education outreach programs.

Some of the thousands of murals in the city of Philadelphia.

While the Mural Arts Program has evolved from an anti-graffiti program into a city-wide public art organization, it has become increasingly important for it to manage the myriad of information about each of the murals and the communities they serve. The Mural Arts Program needed to track and publish information about a whole range of details, including artists, assistants, sponsors, affiliated organizations, start, completion and dedication dates, maintenance activities, renovation and demolition details and (of course) the location of each mural. Organizing this type of information is not a unique concern for Mural Arts. Hundreds of organizations throughout the U.S. are managing public art programs and face similar issues. Emails about various pieces of art go back and forth within the organization, some data is stored in Excel, other data in Filemaker, photos in other systems, some of it is on the organization’s web site while much of it is not. And in the end, staff members have a hard time carrying out the mission of their organization efficiently while also effectively showcasing the full extent of their collection to the public, funders, and potential sponsors.

Sajara-powered mural search page on Muralfarm.org.

When the Mural Arts Program approached Azavea, they had a good sense of what they were looking for. And, after building PhillyHistory.org for the Philadelphia Department of Records to assist them with the management and showcasing of 100,000′s of historic photos of the city, we thought that a geographic collection like that of the Mural Arts Program would be a good fit for Sajara, our geographic digital asset management solution…And so Muralfarm.org was born.

Muralfarm.org enables the public to search for murals by artist, theme, date, keywords and of course by location and neighborhood. Visitors to Muralfarm.org can tag favorite murals, save searches, be notified when new murals are added thanks to GeoRSS feeds, and enjoy special features like viewing the murals in Google Earth and Google Maps – StreetView. At the same time, the Mural Arts Program staff can easily manage information pertaining to each mural through Sajara’s digital asset management tools. And because the whole system is web-based, artists, neighborhood residents, community groups, and Mural Arts Program staff members can leave comments, send messages, and make corrections online, thereby improving the database over time. By pairing both public access search capabilites with collection management tools, Mural Arts is able to use the Sajara platform to store critical information about its projects in a single place.

The Mural Arts Program has essentially created a huge outdoor, geographically distributed museum with the entire city as its canvas. Muralfarm.org brings more than 1,000 murals under one roof, enables the public to see them all, and creates a foundation for the program to more effectively manage information about each work of art. It’s a living, changing feast for the eyes!

By the way, have you seen our newly launched Sajara website? We think it’s quite slick, how about you? Send us your comments.

Azavea Partners with Pictometry to Offer Oblique, Geo-Referenced, Aerial Imagery

" Unlike a traditional GIS that displays information from a top-down perspective, Pictometry's map information is displayed from an angled view, or oblique perspective..."

Pictometry is a leading provider of geo-referenced aerial, oblique image libraries, and related software. If you are already using their imagery or are planning to, we are happy to announce that we are now participating in Pictometry‘s Demonstration program. The Demonstration program has provided us with Pictometry’s web software development kit (SDK) which will allow us to incorporate Pictometry’s imagery into new applications. So next time you work with us and if you are already using Pictometry, let us know … we might be able to get their imagery into your application.

Pictometry offers users oblique views of features, viewed from any
direction. The above example includes three views of the Philadelphia
Museum of Art.

Unlike a traditional GIS that displays information from a top-down perspective, Pictometry’s map information is displayed from both orthogonal and oblique angles, creating a more natural three-dimensional view enabling users to see land features and structures clearly, and in their entirety. You may have already seen examples of Pictometry if you have used the ‘Bird’s Eye’ view in Microsoft’s Live Maps.

The oblique images provide a unique perspective on features, and that perspective is extended even further in the application by enabling users to view features from different directions. There is more to do than just to look at the images, there are tools to perform normal GIS functions like pan, measure lengths, and zoom in and out. More advanced tools enable you to measure height, determine the bearing of a line, and calculate areas.

What the Heck is … Cloud Computing?

"Cloud Computing...got started several years ago with attempts to engage global networks of PC's into large-scale science problems"
From L to R: Amazon Web Services, Google AppEngine and Microsoft’s Azure Services offer some of the leading cloud computing platforms.

In the beginning, computers were devices that filled rooms and whole buildings. They slowly shrank in size until, in the 1980′s, computing underwent a revolution, bringing Apple Macs and IBM PC’s to our desks. In the 1990′s, we began to connect all those personal computers to each other using the internet, creating a global network of computers. We are now in the midst of another revolution. The current transformation is again returning computing power back to machines that fill rooms and even entire warehouses, but this time, instead of a single computer filling that space, there are thousands of them filling data centers run by new, old and unexpected companies. These new data centers are being used to create a ‘cloud’ of network-accessible services and have recently been rebranded with the latest buzzword (at Azavea, we always seek to be fully buzzword-compliant) as ‘cloud computing’.

DecisionTree geographic calculation tools running on the Amazon cloud computing
services will enable you to run high performance geographic calculators without requiring your own infrastructure.

Cloud computing has actually been around for a while. Even before the internet, networked computers that could break up many tasks into small chunks were said to be engaged in ‘distributed computing‘ or, more recently, ‘grid computing‘. Cloud computing is the same concept applied to internet-connected computers. It really got started several years ago with attempts to engage global networks of PC’s into large-scale science problems. The SETI@home project enabled people to contribute their idle PC’s computing power toward examining radio signals for evidence of extraterrestrial life. Similar projects for insight into protein folding diseases, decryption and the Large Hadron Collider for processing LHC experiments have been joined by global networks of spammers and hackers who manage thousands of compromised computers to form ‘botnets’ that are used to attack government computer systems or blackmail companies.

Aside from those bent on curing cancer or instigating global mayhem, contemporary cloud computing efforts are frequently aimed at more modest objectives. Amazon.com, the retailer, is one of the leaders in this field. What began as a way for Amazon to sell unused capacity in its data centers, Amazon Web Services (AWS) is now an entire suite of reusable services being leveraged for all sorts of activities that have nothing to do with selling books and movies. The AWS Simple Storage Service (S3) is an online data storage service. The AWS Elastic Compute Cloud (EC2) enables software developers to create ‘virtual’ computers running Linux or Windows that can be applied to any computing task. Other AWS services include credit card transactions, message queues, web search, and order fulfillment. AWS has been joined by similar services at Google and a new Microsoft effort called Azure.

Many cloud computing providers provide dashboards displaying system availability.

Now imagine you are a small company that has a new idea that will require lots of computer servers. Before AWS and other services, you would have purchased your own servers and built a data center. Now, you can skip all that hassle by hosting your new idea on an infrastructure maintained at a much lower cost by Amazon, Google or Microsoft. These services are priced like your electricity and gas — you pay by the unit of storage, computing time or other metric. So as you need more capacity, you fire up another virtual server, but you only pay for what you use.

So what does cloud computing mean for geospatial services? Cloud-based geospatial services are already common. The API’s for GoogleMaps, Yahoo!Maps, Microsoft Virtual Earth, and ESRI ArcGIS Online systems already provide some basic map display, geocoding, routing and other geospatial information services as hosted services. While none of these are based on the metered pricing that Amazon offers, I’m confident this type of business model is coming. A new company, Cloudmade, is focused on creating commercial services that leverage the OpenStreetMap database.

At Azavea, our cloud computing work has focused on two of our services: Cicero and DecisionTree. To learn more about Dave Felcan’s research project on the AWS Elastic Compute Cloud (EC2), read his article below.

Research: The Amazon Elastic Cloud

"I am exploring the use of The Elastic Compute Cloud (EC2) as a resource for some of Azavea projects already in use. DecisionTree, our geographic prioritization system, was an ideal first candidate..."

I am very excited about my Azavea research project on the Amazon Elastic Compute Cloud (Amazon EC2), a technology from Amazon Inc. that is shifting a lot of people’s ideas about what computing is and can do. Amazon EC2 has arisen through the confluence of technological innovations of the past few years.

First some background. One of the most basic pieces of infrastructure in the World Wide Web today is the ubiquitous entity known as “The Server”. This term is used for a computer which performs some task or tasks on behalf of other computers. For example, web pages come from a web “server”, which sends web pages to your computer for you to see. Moreover this web server may in turn query other servers to complete this request — contact a database server to get data or geospatial server to produce a map image for example.

The idea behind a computing “cloud” (and there are others — as referenced in Robert’s ‘What the Heck is…” article above) is a bunch of computers accessible from the internet which “instantiate” whole virtual computers — with all their associated operating systems, software, data, etc. — that can be accessed on demand. One can instantiate one of these machines, connect to it via the internet through standard remote connection protocols, and voila! your screen shows the desktop for this “computer” that behaves exactly as if it were sitting under your desk.

While for desktops, this approach is odd, for servers there can be many benefits. With a few clicks of a mouse, multiple copies of the same server can be up and running at the same time to handle increases in demand. They can be shut down again when not needed. The details and headaches of actually running and owning physical machinery are offloaded to the cloud provider. The cloud provider also provides bandwidth. Once you have a working version of a website, database, or geospatial server, it can be copied and reused — no need to start from scratch with configuration.

For my research project, I am exploring the use of The Elastic Compute Cloud (EC2) as a resource for some of Azavea projects already in use. DecisionTree, our geographic prioritization web system, was an ideal first candidate. This product requires strong computing resources and was designed from the ground up to be able to run on multiple computers. With EC2 we were able to run DecisionTree on 10 instances at once, dramatically speeding up its operations and providing a mechanism for running DecisionTree for customers who do not want to maintain their own server infrastructure.

In addition to DecisionTree, we are also experimenting with running our Cicero legislative and election data service on EC2 as well as other ways to leverage the Amazon Web Services. For example, last spring, we tested a map image ’tile cache’ service that will generate and store a set of map tiles, enabling an organization to reduce bandwidth usage and improve responsiveness of a high traffic web mapping application. While EC2 was originally limited to Linux-based software, the recent addition of Windows Server as a target platform has provided much more flexibility. Do you have ideas for how you could use Amazon Web Services for your GIS project? Let us know.

Meet David Middlecamp: Another Great Addition to the Azavea Team!

Fearless Minnesotan, David looks forward to experiencing all that Philadelphia's humid summer months have to offer.
David Middlecamp with … Monty Python. Really?

David Middlecamp recently joined Azavea as a Software Developer. He most recently worked optimizing the management of millions of professional photographs at Collages.net, and before that worked on medical software at Wolters Kluwer. He is excited to contribute to the success of each new project that comes his way, and is starting with ‘Sandstone’, a new tool to assist in the preservation of historical monuments. David just moved to Philadelphia from his entrepreneurial roots in Minnesota. Be prepared to see David outside without a jacket, since he has little sympathy for snow or cold. He enjoys Monty Python, theater, music, chess, clever ideas, and quirky countercultures.

Tags:

Azavea Newsletter Puzzle: Explore Muralfarm.org

One of Philadelphia’s many murals (and a clue to one of the puzzle questions’ answers).

We are excited to announce the launch of Muralfarm.org, a new web-based database that enables users to search through and view information about murals produced by the Philadelphia Mural Arts Program. Muralfarm.org is powered by Sajara, Azavea’s geographic digital asset management software.

Want to learn more about Philadelphia’s many murals but just don’t have the time to take the fabulous bus tours? Head to www.muralfarm.org to check out the new database and find the answers to this month’s puzzle. Send your answers to info@azavea.com. Be the first to send in all 3 correct answers and receive a $25 gift card to Barnes & Noble! An adventure AND a reward, life couldn’t get much better!

1. A large, two-part mural is located at the intersection of Aspen Street and N. 39th Street. Use the address search function on Muralfarm.org to locate the mural. What is the title of the mural?

The ‘Magic Wall’ mural. Do you know where it is located?

2. Several artists were involved in the creation of this mural including Students from the University of Pennsylvania, Angela Crafton, and Donald Gensler. Click on Angela Crafton’s name to view other murals she has created. One of her murals is entitled “Magic Wall.” At what address is this mural located?

3. Click the ‘Search this address’ button located below the address to find other nearby murals. The closest mural to “Magic Wall” is a mural entitled “A Window of Stories.” What is the ‘Theme’ assigned to this mural?