IST Home > IST Division > Data Services > Blog

Local Navigation:


Articles by Chris Hoffman

Author Email: Contact Author

 

Indianapolis Museum of Art Dashboard

Thursday, September 4th, 2008 by Chris Hoffman

I was talking this morning with Peter Cava (Data Warehouse Services Manager here at UC Berkeley) about the (potential) intersection between business intelligence systems used for administrative systems here and the kinds of data aggregation and analysis performed by research scientists and faculty working with museum collections. Afterward, I was looking at the preliminary program for the Museum Computer Network conference this fall and saw that Rob Stein at Indianapolis Museum of Art is giving a talk about a dashboard that they have developed to help measure various aspects about the museum’s performance. I can’t resist when these kinds of connections happen — they always lead in interesting directions. The IMA Dashboard is up on the web, and they should be applauded for their emphasis on transparency. I also enjoyed reading the blog post at the Powerhouse Museum in Sydney featuring an interview with Rob about the project, and this pointed me to a report written by Maxwell L. Anderson for the Getty, titled “Metrics of Success in Art Museums.”

Now it’s time to get in touch with Rob!

Research directions using aggregated museum data sets

Sunday, August 10th, 2008 by Chris Hoffman

For quite awhile now, I’ve been thinking about the value of aggregating content and information in museum collections. I think it is generally accepted now that museums and collections of many kinds need to make larger portions of their collections available online to the public, and efforts to digitize collections and webify collections data are producing wonderful results. At the same time, aside from good public relations, what’s in it for the museums and for scholarship in general? What new information or new research directions might emerge from aggregations of museum content? Not surprisingly, in natural history and biodiversity research, the power of numbers, of volume, has been recognized for a long time. Single specimens are nice as types, but in order to learn something about ecological systems and evolution, you need statistically valid numbers. In cultural heritage collections, the possibilities are less clear. Some recent work in England has been interesting though perhaps more from the perspective of studying the history of museums and even of colonialism. Museum studies are still especially interested in the individual object or the subcollection. Rather than focusing on the unique individual object or specimen, what can we learn by unlocking and aggregating content in collections? What research questions emerge? What are the limitations and the opportunities?

Here’s one idea I’ve been thinking about that would pertain to Anthropology and Archaeology collections. We could look at the combinations of material and technique across culture, time and space. We’d expect certain combinations to be visible, but I suspect we would be surprised on numerous occasions. The semantic index that supports the Phoebe A. Hearst Museum of Anthropology’s Delphi system could be an excellent source for this project. I might even revisit some of my dissertation materials. Yikes…

What would be problematic about such a study? Data quality within and across collections would be an important consideration. Would we know which objects were documented at a sufficient level of detail? Would we know which parts of the collection were studied more closely? Would we know which museum specialists were “good” at their jobs? Would we know which objects or collections had been reviewed by multiple museum specialists? The number of biases would be large and problematic. But hey, I’m an archaeologist by training. I’m used to studying a messy data set and making a large number of assumptions.

What kinds of things can be done to address these biases? We could select only sets of data that had been carefully studied, but that in and of itself will create bias. We could try to enrich the data ourselves, but the sheer scope of that effort is terrifying. That’s where crowd sourcing, tagging and annotation could come in. By getting our collections online and allowing other experts (including the public) to enrich our content, we can incrementally improve the quality of our information. Other projects are showing how this can be productively done. However, how much work has been done on assessing the quality of tagging and annotation in a setting such as this? Interestingly the CalPhotos system has been allowing reviewers to annotate and re-identify species for many years. CalPhotos then might provide a good context in which to study the results of annotation and review.

Keck Hydrowatch sensor network project featured on KQED Quest

Tuesday, July 29th, 2008 by Chris Hoffman

As reported on the UCB home page, the Keck Hydrowatch sensor network project was featured on the July 22nd edition of KQED’s science program, Quest. Collin Bode, a programmer who occasionally sits in the BSCIT office when he’s not working in the field stations, makes a few appearances in the video. Collin is working with Ginger Ogle and John Deck to develop a system for the retrieval, storage, and display of automated time-series data from the sensor network deployed for Keck Hydrowatch. They are implementing a data architecture designed by CUAHSI called the “Observations Data Model” (moving it to the open source MySQL database and extending it to handle logging and other capabilities) and developing the necessary scripts and libraries to input data from multiple sources. In the second phase of the project, a web-based system for viewing and downloading the sensor network data will be developed.

Mashed Museum wiki and group in UK

Monday, June 30th, 2008 by Chris Hoffman

Mike Ellis, frequent presenter at Museums and the Web, recently published a note on the MCN list describing a Mashed Museum day that was recently held in the UK.

————–

Dear MCN

I thought you might be interested to see a brief(ish) video I hacked
together following the MCG “Mashed Museum” day which happened on the
18th June, the day before the UK Museums on the Web Conference.

See http://blip.tv/file/1029060

Further coverage continues at www.mashedmuseum.org.uk

Cheers!

Mike

————

They’ve also set up a Google Group at http://groups.google.com/group/mashedmuseum. From here, there are several other links that might take you in some interesting directions! For instance, check out the hoard.it prototype at http://feeds.boxuk.com/museums/.

Museums and the Web 2008

Tuesday, April 15th, 2008 by Chris Hoffman

I’ve recently returned from a great conference, Museums and the Web 2008, and wanted to share some highlights on DSBLOG. MW2008 was held in Montreal, April 9-12, and was attended by several hundred people from hundreds of institutions around the world. While the community is dominated by cultural heritage institutions, there is fairly good representation from science museums and archives. Sadly there were only a few natural history institutions present. The conference web site, http://www.archimuse.com/mw2008/, includes links to the papers presented through the list of speakers. The organizers encouraged online activity, and there was a fairly good amount of blogging and activity on Twitter and flickr. http://conference.archimuse.com/ is the ongoing home for this activity.

Thematically, there was much discussion about a set of familiar issues: how to engage the community, issues of authority and attribution, and the role and voice of museums and related institutions. Web 2.0 technologies figured prominently with several interesting experiments presented. Geospatial services and technologies also received a fair amount of attention. And a fair number of people are starting to work on different ways to aggregate large amounts of information and provide useful ways to search for, browse through, and understand or interpret content. One of the highlights here was a presentation by Michael Black (Phoebe Apperson Hearst Museum of Anthropology) and Patrick Schmitz (IST-Data Services) on Delphi, the faceted browser and semantic corpus being developed for PAHMA. The combination of natural language processing, data mining, and semantic technologies is emerging as an important trend in the museum space.

Finally, I attended MW2008 in order to represent the OpenCollection project. We had a booth in the exhibits hall, right next to some of the larger vendors in the collections management systems market. The interest from this community and the response to our vision and goals was remarkable. In presentations as well as hallway conversations, the subject of open source solutions came up frequently. What makes an open source solution successful? What are the risks? What will make institution X comfortable enough to try open source solution Y? These discussions continued into the night.

All in all, Museums and the Web 2008 was an excellent conference. The level of discussion and engagement that included museum professionals and technical experts, public and private institutions and for-profit companies was very refreshing.


UC Berkeley UC Berkeley CIO Campuswide IT Service Providers
Site Map Contact Webmaster