Common Tag
Posted by admin in Uncategorized on October 7th, 2009
Common Tag is an open tagging format developed to make content more connected, discoverable and engaging. Unlike free-text tags, Common Tags are references to unique, well-defined concepts, complete with metadata and their own URLs. With Common Tag, site owners can more easily create topic hubs, cross-promote their content, and enrich their pages with free data, images and widgets
Trends in Application of Data on the Web
Posted by admin in Uncategorized on September 15th, 2009

It’s definitely not all expressed in RDF or explicitly RDF-variant formats, but it seems important to note the many new ways the web-of-DATA is evolving alongside, around, above, between and underneath the web-of-DOCUMENTS.
Micro-blogging, and Twitter in particular with its 140 character limit, is perhaps one of the most obvious examples of new areas of the web decomposing into ever smaller particles of information.
The following are reports or examples of emerging application of data on the web.
ReadWriteWeb’s Top 5 Web Trends of 2009
PriceWaterhouseCoopers – Technology Forecast Spring 09
- Spinning a data Web
- Making Semantic Web connections
- A CIO’s strategy for rethinking “messy BI”
- Traversing the Giant Global Graph
- From folksonomies to ontologies
- How the Semantic Web might improve cancer treatment
- Semantic technologies at the ecosystem level
Explaining APML: what it is & why you want it
- Filtering
- Accountability
- Privacy
- Shared Data
- You’re the boss!
- APML FAQ
Facebook Using Twitter’s @ Tagging Feature
Question: How can this approach, or any other popular tagging mechanism, be integrated with Linked Data community’s focus on URIs?
Dewey Classification as Linked Data
“We had to come up with a URI pattern for the DDC that would generate persistent identifiers for DDC concepts in a distributed environment”
Top three levels of Dewey Decimal Classification published as linked data
BBC
- Media Meets Semantic Web – How the BBC Uses DBpedia and Linked Data to Make Connections
- The BBC has announced a couple SPARQL endpoints, hosted by talis and openlink
- A social semantic BBC?
Open Government
Tricorder = iPhone + Linked-Data ?
Posted by admin in Linked Data, User_Interface on August 28th, 2009

Augmented Reality Apps May Provide New Motivation for Publishing Linked Data
Recently there have been several articles on location/Augmented-Reality based applications such as those referenced below:
The Wall Has Fallen: 3 Augmented Reality Apps Now Live in iPhone App Store
Mobile phones get cyborg vision
Mobilizy Previews Augmented Reality GPS Navigation App
RobotVision: A Bing-powered iPhone Augmented Reality Browser
Apple Developing Location-Based Home Screen for the iPhone?
and others like:
Augmented reality: iPhone 3G S killer app?
wherein one finds the following enticing quote from CNET author Scott Stein, about real-time scanning for data:
“If augmented-reality navigation apps can progress at the same feverish development pace as the rest of the App Store, the new iPhone could soon be capable of location-based feats that approach tricorder capabilities, scanning the environment in real-time for data“
Though popular location-based apps have already been demonstrating the potential of data on the web, reference to the “tricorder”, of course, invokes sci-fi style visions. Perhaps this bit of futuristic fantasy materializing so rapidly before us will help to make clearer the transformative potential of Linked-Data/Web-of-Data/Data-Web.
MARC/MODS and Automating Migration to Linked-Data Standards
Posted by admin in Library World, Linked Data, Standard on August 11th, 2009
The following was posted to To: CODE4LIB@listserv.nd.edu
on: Tue, Aug 11, 2009 at 3:23 PM
Look here for post and responses.
Hi All,
I just recently subscribed to this list and have been watching for a few days, expecting that I would do so for a while longer before jumping in. However I couldn’t help but take special note of recent posts with mention of MARCXML and MODS and discussion, at least indirectly, of how those formats “play” with “linked-data” standards. Since that is an area close to where I have been working lately, I thought I’d offer a comment and also ask for some friendly feedback.
First my comment:
Here at UC San Diego Libraries, where I work, we have been generating RDF data for a couple of years now, and more recently working with triplestores and SPARQL. We also, no surprise, have lots of MARC data, and have developed some local strategies for migrating MARC to MODS to RDF with a very local conversion scheme. In order to learn more about OWL and ontologies, and possibly to create a more generally useful/acceptable expression of our MARC/MODS data as RDF I launched into a project to convert the
Library of Congress MODS XML schema
into a formal OWL ontology. At one level this can be approached as a rather mechanical process, on the other hand, I made some adjustments to MODS predicate naming, with the intent of providing more meaning to individual MODS-based RDF triples. I won’t try to explain that further here, but if anyone has additional interest, more information is available on my effort to produce and provide validity for a MODS ontology on my blog, starting at a post entitled:
Another Step Toward Lifting Library Metadata into the Cloud
and in following posts with comments and replies from and to Bruce D’Arcus, especially regarding Bibliographic Ontology.
That’s the end of my comment.
So now my question(s), or request for feedback.
Can we identify, some generally agreed on automateable strategy for converting MARC/MODS to RDF (without having to limit to Dublin Core). Or, in case I’m missing something, what work has already been done in that direction?
As a corollary, I would appreciate thoughts any of you have on the value of continuing the effort to develop a MODS ontology? I attended the Semantic Technology Conference recently where I was a speaker in a:
and received quite a bit of interest at the conference, though I met very few from the library community there.
I had hoped to provide something that could:
- Potentially be more universal than our current local approach to
expressing MODS in RDF - Assign class and predicate names in an attempt to make dealing with blank noes and SPARQL queries simpler and more natural, given the (to me) somewhat complicated structure of MODS.
- Provide a formal OWL base for assigning owl:sameAs relationships, alternate rdfs:label values, etc.
However, I am very mindful of (and sympathetic to) thoughts such as the following from Ed Summers, regarding:
“…taking a more organic approach to vocabulary selection, mixing and matching vocabulary elements rather than imposing a particular metadata world-view”
That would make sense to me if there was a generally accepted way to automate the conversion.
Sorry for the somewhat long introductory comment and thanks in advance for any helpful thoughts or suggestions.
Chris Frymann
Digital Library Architect
University of California San Diego Libraries
Email: cfrymann@ucsd.edu
Blog: http://chrisfrymann.com
Reference: An Enterprise Near to Collapse ?
Posted by admin in Editorial, Library World on August 11th, 2009
![]()
The following is an excerpt from a conversation between:
Charlie Stross (CS) – Hugo Award winning author and
Paul Krugman (PK) – Nobel Prize winning economist and columnist for the New York Times
at the 2009 World Science Fiction Convention in Montreal
“…
PK: We still haven’t figured out the economics of easy information dissemination. Even though the Internet is all old hat, we still haven’t seen the economics of it play out. One of the big problems is we don’t know how do people get compensated for producing information when it can be …
CS: This is a personal preoccupation of mine, shall we say.
PK: It’s to some extent mine, although more of one of my employers. The New York Times has got enormous web presence, four million or so people read it online and yield the corporation very little in the way of revenue in the process. Whereas the dwindling number of people who want the dead tree paper are the source of … and the thing survives to some extent because people still like a piece of paper with their breakfast coffee but also to a large extent because you still can’t online get quite the visual quality of color advertisements for luxury goods that you can get in the New York Times Magazine. But you’re relying upon a very thin lag in technology to make the whole enterprise of creating and disseminating information viable. And if that starts to apply to lots of physical goods as well, we’re going to see whole sectors just implode.
CS: Oh, yeah. On the other hand, with physical goods, you’re still going to need mass and energy to assemble the frames. As for the intellectual property, I try not to get too worked up about it. There’s a lot of people angsting about piracy and copying of stuff on the Internet, publishers who are very, very worried about the whole idea of ebook piracy. I like to get a little bit of perspective on it by remembering that back before the Internet came along, we had a very special term for the people who buy a single copy of a book and then allow all their friends to read it for free. We called them librarians.
PK: Which is why … we used to work the professional journals, something I do know something about, professional journals sold about a couple of thousand copies worldwide, at an enormous price because every university library felt it had to have them and still does to some extent, but that’s an enterprise near to collapse because everybody reads the things online now.
Reference: “Triple Stores Aren’t”
Posted by admin in Infrastructure, Tools on August 7th, 2009
…
Is there anything harmful with the misnomerization of “triple”, enough for the community to try their best to start talking about “tuples”? I think there is. Linked Data is the best example of how a focus on the three-ness of triples can fool people into sub-optimal implementations. I heard this fear expressed several times during the conference, although not in those words. More than once, people expressed concern that once data had been extracted via SPARQL and gone into the Linked Data cloud, there was no way to determine where the data had come from, what its provenance was, or whether is could be trusted. He was absolutely correct- if the implementation was such that the raw triple was allowed to separate from its source. If there was a greater understanding of the un-three-ness of real rdf tuplestores, then implementers of linked data would be more careful not to obliterate the id information that could enable trust and provenance. I come away from the conference both excited by Linked Data and worried that the Linked Data promoters seemed to brush-off this concern.”
Reference: “A Reflection on the Structure and Process of the Web of Data”
Posted by admin in Infrastructure on August 7th, 2009

A Reflection on the Structure and Process of the Web of Data
Reference: “Using Linked Data” Diagram
Posted by admin in Linked Data on August 7th, 2009
Reference: “Can Librarians Be Put Directly Onto the Semantic Web?”
Posted by admin in Library World on August 5th, 2009
Can Librarians Be Put Directly Onto the Semantic Web?
…
To some extent, the success of the semantic web in any domain is predicated on the successful embodiment of that domain’s knowledge in ontological code. Either coders need to learn the domain knowledge, or domain experts need to learn to code. People need to talk.”
Conversation with Bruce D’Arcus on Motivation for MODS Ontology
Posted by admin in Library World, Linked Data, Standard on August 5th, 2009
Comment From: Bruce D’Arcus – July 31st, 2009 at 14:18
in response to my previous post:
More on Motivation for Investment in Implementation of a MODS Ontology
It’s semantics are also really loose.
So you have to ask yourself, just how linked could a MODS view in RDF really be?
====================================
from Chris Frymann
to bdarcus@gmail.com
date Mon, Aug 3, 2009 at 1:12 PM
subject Re: Comment on Motivation for MODS Ontology
Thanks for your comment on my blog post:
More on Motivation for Investment in Implementation of a MODS Ontology
http://www.chrisfrymann.com/2009/07/31/more-on-motivation-for-investment-in-implementation-of-a-mods-ontology
I pretty much agree with what you write in your first two sentences:
“The problem from my standpoint is that MODS
has some really odd, library-specific, design choices
that I don’t think map very well to the wider world.
A central concept like mods:name,
with mods:role as a child of that,
really makes no sense, and conflicts with
more common modeling you see in DC, FRBR ,etc.
It’s semantics are also really loose.”
I also think you do ask a very important question:
“So you have to ask yourself, just how linked could a MODS view in
RDF really be?”
As I mentioned in my blog post:
The primary motivation for my work on a MODS ontology is directed toward
identifying an acceptable strategy for migrating existing MARC based metadata
into a form more universally accessible to consumers and producers of
Linked Data.
I’m not really deeply familiar with a lot of the history of community
discussion in this area, so over the weekend I took a look at your
blog, listened to a “Library Geeks” podcast you participated in a few
years ago, and browsed around a few other related links.
I know that such a brief intro can’t do anyone justice, but I think I
have a bit of an idea as to where you’re coming from, and I want to
say first thing that I am not particularly a defender of either MARC
or MODS. I am however a “big” supporter/promoter of concepts relating
to RDF, linked data and the various components of the “Semantic Web
Layer Cake”. Also, though I primarily consider myself a software
developer, and my title is “Digital Library Architect”, I work in a
library where MARC-based cataloging is still the primary form of
metadata creation and MODS/METS are probably the dominant leading edge
standards.
In my two most recent blog posts I tried to provide more detail on my
local situation in which we have implemented a very crude RDF
manifestation of MODS, have entered on the order of 15 million triples
into a triplestore (AllegroGraph), and are successfully using SPARQL
queries to search and process that data. We also feel that we have
developed a pretty nice JavaScript/JSON/AJAX-based user interface
enabling users to discover and view resources.
We would like to share some of this with others, but we also
understand that indeed no one else would be likely to adopt our
particular mode of what ultimately does amount to expressing MARC in
RDF. That is what got me started in thinking that if there really was
a community accepted/acceptable MODS ontology that we could use, it
could make our system more attractive to others.
As it stands logically, I think it would be “more” attractive to
others. However, your question:
“Just how linked could a MODS view in RDF really be?”
is a very good one, and I don’t really know enough to answer that. My
main thought has been simply that a “better” RDF view of MODS than the
one we are currently working with would at least be significant
progress. The serious question you seem to be posing though is that
even with a formal ontology for MODS, that still might not provide
enough new usability for others to consider adopting it as a serious
“standard”.
I only recently learned about your “Bibliographic Ontology
Specification” and though I have not yet had a chance to look at it
deeply, I am impressed with the direction you seem to be heading.
My first basic question though is, What thoughts, if any, do you have
regarding a strategy for migrating MARC to Bibliographic Ontology? I
expect you must have had some discussion in that area already and
would appreciate any references or comments you might send.
Finally, and this is just a fairly small point that I didn’t know
where else to add but here at the end. One of the things I did make a
somewhat feeble attempt to address in my MODS ontology is a little
better approach to handling the problem you mention of “role” being a
child element of “mods:name”. I know it’s not much help, but one
small, although I expect still controversial, thing I suggest is that
the various hierarchically structured MODS elements be given their own
individual predicate identifiers (such as: mods:name.role.roleTerm,
mods:name.role.roleTerm.authority, etc.) which would at least make
such items more directly SPARQL queryable without having to explicitly
build a SPARQL query that mirrors the graph structure of the fact
that terms like “role” may in some cases only be identifiable as a
graph subelement of some parent term like “name”. These identifiers
could also be assigned (owl:sameas) variants in whatever manner seems
most helpful, as ultimately the basic point of a MODS ontology would
largely be (or at least it is for us) to to specify a fixed/standard
set of MODS predicates. This could help to deal with the fact that
different MODS identifiers may have somewhat different meanings
depending on where they appear in the MODS hierarchy.
Sincerely,
- Chris
Chris Frymann
Digital Library Architect
University of California San Diego Libraries
blog: http://chrisfrymann.com
=============================
from Bruce D’Arcus
to Chris Frymann
date Mon, Aug 3, 2009 at 2:56 PM
subject Re: Comment on Motivation for MODS Ontology
On Mon, Aug 3, 2009 at 4:12 PM, Chris Frymann wrote:
…
> As it stands logically, I think it would be “more” attractive to
> others. However, your question:
>
> “Just how linked could a MODS view in RDF really be?”
>
> is a very good one, and I don’t really know enough to answer that. My
> main thought has been simply that a “better” RDF view of MODS than the
> one we are currently working with would at least be significant
> progress. The serious question you seem to be posing though is that
> even with a formal ontology for MODS, that still might not provide
> enough new usability for others to consider adopting it as a serious
> “standard”.
Yes. I don’t think people outside of the library world pay much
attention to library standards, which are often taken to be odd and
baroque.
> I only recently learned about your “Bibliographic Ontology
> Specification” and though I have not yet had a chance to look at it
> deeply, I am impressed with the direction you seem to be heading.
>
> My first basic question though is, What thoughts, if any, do you have
> regarding a strategy for migrating MARC to Bibliographic Ontology? I
> expect you must have had some discussion in that area already and
> would appreciate any references or comments you might send.
I’ve never paid much attention to MARC.But the people at Talis and the
LOC use BIBO, and some of that work has been based on conversion from
MARC. If you’re interested in finding out more, you could always post
a question to the BIBO list?
Bruce
=============================
from Chris Frymann
to Bruce D’Arcus
date Tue, Aug 4, 2009 at 11:33 AM
subject Re: Comment on Motivation for MODS Ontology
Thanks for your reply and suggestion. I’ll try to learn more about
BIBO and how LOC and Talis may be using it.
Again, much of my current focus is on exploring potential paths for
releasing/repurposing the significant body of information that is
essentially locked up in MARC/MODS.
By the way, would you mind if I posted the text of this email
conversation in my blog? It might be of interest to others. Don’t
hesitate to let me know if you prefer not.
- Chris
================================
from Bruce D’Arcus
to Chris Frymann
date Tue, Aug 4, 2009 at 12:29 PM
>
> Thanks for your reply and suggestion. I’ll try to learn more about
> BIBO and how LOC and Talis may be using it.
>
> Again, much of my current focus is on exploring potential paths for
> releasing/repurposing the significant body of information that is
> essentially locked up in MARC/MODS.
Right.
> By the way, would you mind if I posted the text of this email
> conversation in my blog?
No, I don’t mind.
Bruce




Recent Comments