Ruskin on cameras

In the "Who's Reading What" column of The Globe and Mail on Saturday 26 June 2009, Alain de Botton says he is reading The Elements of Drawing by John Ruskin.

The most common response to seeing something beautiful in the world is to take a picture of it or grab the video camera. But this wasn't always the way, and 19th-century art critic John Ruskin, who lived at just the time that people started using cameras, has some fascinating criticisms to make of the camera in a book which every traveller should read, The Elements of Drawing. In his view, taking pictures often prevents people from seeing. Rather than allowing them to see the world more clearly and remember it, it actually distracts them from patient observation.

Ruskin suggested that, when we go travelling, we keep the camera at home and instead resort to two things: drawing and writing. Even if we can't draw at all, Ruskin asked us to have a go, and his book tells us just how we should start to do so.

Drawing something, however badly, always helps you to notice it properly, as does trying to write it down in words. This is a book guaranteed to fix your memories far better than any camera.

I have a two-year-old Canon A430 Powershot digital camera. It's about average as non-SLR digital cameras go, I think, which means it's a technological marvel. But it's not good at capturing anything of large scale. Taking a picture of a glowing orange sunset over a lake or the shades of dark and light green now covering Toronto's ravines is a disappointment.

Part of this is the camera, and different lenses would help. Part of it is me, and more talent would help. But part of it is that the camera just isn't the best way of capturing the scene. A drawing can be far truer.

A relative of mine, a good artist, doesn't take a camera when he travels. He takes sketchbooks, pens and pencils, and a small watercolour kit. He has a shelf of books filled with drawings and paintings from all over the world, and they're far better than any photo album. A building here, an old woman there, a cafe, a hill, a bus, sometimes drawn in haste and sometimes worked on slowly over an afternoon. There's a story behind each one.

I borrowed The Elements of Drawing from the library today. I haven't found where Ruskin says travellers should not take a camera with them but instead write and draw, but I expect to after a closer reading. If I find the exact quote, I'll update this; if not, and in the future you see read this and happen to know where Ruskin says this, please let me know.

Four ways of using Amazon S3

in

I've been thinking about how I might change how I do backups. I set up an account at Amazon Web Services so I could use S3, their online storage system. Since I already have an Amazon account it was trivial to get started.

I'm doing all this on my new Ubuntu machine. Along the way I had to install a bunch of new libraries and packages, and I didn't keep track. I'll mention what I remember. Sorry about that. Oh, and in the examples below I sometimes add some comments with a #. For my examples I use test-file.mp3 and file.jpg, two copies of the same file, which is why they have identical byte sizes.

s3cmd

s3cmd is part of s3tools. I installed it from the s3cmd package and then ran s3cmd --configure to set it up. I didn't tell it to do encryption and it failed a test when I told it to use HTTPS, so I turned that off. It made a .s3cfg file and then I was all set:
$ s3cmd mb s3://wdenton.example
# Two Python warnings skipped in all these quotes
Bucket 'wdenton.example' created
$ s3cmd put test-file.mp3 s3://wdenton.example/
File 'test-file.mp3' stored as s3://wdenton.example/test-file.mp3 (1613952 bytes in 23.6 seconds, 66.84 kB/s) [1 of 1]
$ s3cmd ls
2009-06-28 01:06  s3://wdenton.example
$ s3cmd ls s3://wdenton.example/
Bucket 's3://wdenton.example':
2009-06-28 01:10   1613952   s3://wdenton.example/test-file.mp3
# You can make folders, except they're fake
$ s3cmd put test-file.mp3 s3://wdenton.example/dir/test-file.mp3
File 'test-file.mp3' stored as s3://wdenton.example/dir/test-file.mp3 (1613952 bytes in 19.0 seconds, 82.98 kB/s) [1 of 1]
$ s3cmd ls s3://wdenton.example/
Bucket 's3://wdenton.example':
2009-06-28 02:31   1613952   s3://wdenton.example/dir/test-file.mp3
2009-06-28 01:10   1613952   s3://wdenton.example/test-file.mp3

s3fs

The s3fs project has a page called FuseOverAmazon that explains how to mount an S3 bucket as a file system. (Fuse stands for Filesystem in Userspace.) To get this working I think I installed the libfuse2, libfuse-dev, and fuse-utils packages (or at least they're installed now and everything works), and g++ so I had a C++ compiler. You have to compile this thing by hand, but just run make and move the binary somewhere useful.

There are differences between how s3fs and other things handle "directories" in S3 buckets. In another test where I used s3cmd sync I couldn't see any of the files with s3fs, but in this example I seem to have made it work, sort of.

$ s3fs wdenton.example -o accessKeyId=my_access_key -o secretAccessKey=my_secret_key mnt
$ cd mnt
$ ls -l
# Notice that it doesn't see the dir "directory" I made above
total 1577
---------- 1 root root 1613952 2009-06-27 21:10 test-file.mp3
$ cp ~/file.jpg .
$ ls -l
total 3153
-rw-r--r-- 1 buff buff 1613952 2009-06-27 22:50 file.jpg
---------- 1 root root 1613952 2009-06-27 21:10 test-file.mp3
$ mkdir dir
$ mv file.jpg dir/
$ ls -l
total 1577
drwxr-xr-x 1 buff buff       0 2009-06-27 22:51 dir
---------- 1 root root 1613952 2009-06-27 21:10 test-file.mp3
$ cd dir
$ ls -l
# But after I made dir here, it saw the file I'd put there earlier
total 3153
-rw-r--r-- 1 buff buff 1613952 2009-06-27 22:50 file.jpg
---------- 1 root root 1613952 2009-06-27 22:31 test-file.mp3
$ cd 
$ fusermount -u mnt/

How to s3fs on EC2 Ubuntu has some helpful information.

Duplicity

duplicity gives "encrypted bandwidth-efficient backup using the rsync algorithm." I used rsync and rdiff-backup to synchronize files, and this is the same kind of thing. I installed it from the package.

This environment variable stuff isn't documented in the man page, but it's explained here: Duplicity + Amazon S3 = incremental encrypted remote backup.

$ mkdir test
$ mv file.jpg test-file.mp3 test/
$ export AWS_ACCESS_KEY_ID=my_access_key
$ export AWS_SECRET_ACCESS_KEY=my_secret_key
$ duplicity test/ s3+http://wdenton.example/test/
GnuPG passphrase: [I enter a phrase]
No signatures found, switching to full backup.
Retype passphrase to confirm: [I reenter the phrase]
--------------[ Backup Statistics ]--------------
StartTime 1246158681.12 (Sat Jun 27 23:11:21 2009)
EndTime 1246158681.53 (Sat Jun 27 23:11:21 2009)
ElapsedTime 0.41 (0.41 seconds)
SourceFiles 0
SourceFileSize 3232000 (3.08 MB)
NewFiles 0
NewFileSize 0 (0 bytes)
DeletedFiles 0
ChangedFiles 0
ChangedFileSize 0 (0 bytes)
ChangedDeltaSize 0 (0 bytes)
DeltaEntries 0
RawDeltaSize 3227904 (3.08 MB)
TotalDestinationSizeChange 2956111 (2.82 MB)
Errors 0
-------------------------------------------------
# Let's see what's in the test/ directory up there
$ s3cmd ls s3://wdenton.example/test/
Bucket 's3://wdenton.example':
2009-06-28 03:12     76249   s3://wdenton.example/test/duplicity-full-signatures.2009-06-27T23:11:12-04:00.sigtar.gpg
2009-06-28 03:12       189   s3://wdenton.example/test/duplicity-full.2009-06-27T23:11:12-04:00.manifest.gpg
2009-06-28 03:11   2955922   s3://wdenton.example/test/duplicity-full.2009-06-27T23:11:12-04:00.vol1.difftar.gpg
# Now let's do a test restore
$ cd /tmp
$ duplicity s3+http://wdenton.example/test/ test/
GnuPG passphrase: [I enter the phrase]
$ cd test/
$ ll
total 3168
-rw-r--r-- 1 buff buff 1613952 2009-06-27 22:28 file.jpg
-rw-r--r-- 1 buff buff 1613952 2009-06-27 21:10 test-file.mp3

S3Fox Organizer

S3Fox is the simplest way to get into your S3 space: it's a Firefox extension and lets you drag and drop files. Here's a screenshot:

Screenshot of S3Fox (small version)

Something about the dir/ directory I made with s3fs doesn't work here, but the test/ directory I made with duplicity does. s3fs doesn't see the test/ directory. It seems like if you go with s3fs you pretty much have to stick with it, but these other tools are work nicely together.

Conclusion

If I do start using this for backups, I'll use Duplicity and do manual stuff with s3cmd and S3Fox.

branchhours

I set up a Github repository for branchhours, a short Ruby script I wrote to help show library branch opening hours on the home page. I set up a bunch of Google calendars, one for each branch, and use them to hold the hours the branch is open or an indicator it's closed. It's not in production yet, but it seems to work OK as it is, so I thought I'd mention it. I'll post more about it when it goes into use.

If the idea of using Google Calendar to show library branch opening hours on a home page seems familiar, perhaps that's because you read Using Google Calendar to Manage Library Website Hours by Andrew Darby (Code4Lib Journal #2). It's a simpler variation on what he did, in Ruby instead of PHP and without using a database or showing reference librarian information.

At Their Fingertips

My colleague Sarah Coysh, e-learning librarian at York University Libraries, and I gave a talk at the 2009 Canadian Library Association Conference on 1 June in Montreal: At Their Fingertips: Customized Library Resources in Course Management Systems and Student Portals. Slides (with notes) and audio (I put my MP3 player on the lectern) are available. If you're interested in seeing screenshots of another university's Moodle and student portal then have a look.

Video of What We Talk About When We Talk About FRBR

in

In late February Jodi Schneider and I did a talk at Code4Lib 2009, and I posted the slides on The FRBR Blog: What We Talk About When We Talk About FRBR.

The video of the talk is now online!

Thanks to Talis and Karen Schneider and the Code4Lib organizers and Brown University for doing all the work.

UPDATE 22 June 2009: The video is also up at the Internet Archive, along with the rest of the C4L 2009 talks.

Stoic podcasts and interviews

"It would perhaps increase my acquaintance, the thing which I chiefly study to decline."

in

Quote from a letter from Sir Isaac Newton to one Collins, 18 February 1669/70:

The solution of the annuity problem, if it will be of any use, you have my leave to insert it into the Philosophical Transactions, so it be without my name to it. For I see not what there is desirable in public esteem, were I able to acquire and maintain it. It would perhaps increase my acquaintance, the thing which I chiefly study to decline.

Found when I flipped through Correspondence of Scientific Men of the Seventeenth Century, edited by Stephen Jordan Rigaud (with an index by Augustus de Morgan!).

Footnotes in The Spellman Files

I added to Lisa Lutz's The Spellman Files to the list of fictional footnotes. You can download the footnotes:

For those of you on the Kindle, or eReaders, you'll be happy to know that we're now making available the footnotes for all the books in a downloadable PDF (which you can use for quick reference): Download Footnotes for The Spellman Files. [PDF]

Are the footnotes not available in e-book versions of the novel? Is this generally true? I've never seen a Kindle, much less examined how it handles footnotes.

isbn2marc is back

My Ruby script isbn2marc is back. I thought I needed to use the advice in Configuring .htaccess to ignore specific subfolders but it turned out the Options Indexes line in /src/.htaccess was choking things and causing permissions errors. Beats me why, but at least it's fixed. Sorry about the disruption.

Code4Lib 2009 notes

I went to Code4Lib2009 in late February and had a fantastic time, not just because Jodi Schneider and I gave a talk called What We Talk About When We Talk About FRBR. The whole conference was a blast and it was great to hang out with people I knew and meet new people.

Ed Summers posted his conference notes, Jonathan Rochkind doesn't like the Code4Lib award idea Eric Morgan put forward, Karen Coombs posted notes, Jay Luker did a cool IRC channel timeline, Terry Reese did ten things to take away from it, Jon Phipps wasn't enchanted with it all.

Here are my notes, pretty much raw. I didn't take notes on the linked data pre-conference, the morning of which was excellent. The afternoon got a little too loose and I got distracted with some annoying Ruby on Rails problem.

Monday 24 February 2009

Roy Tennant: Introduction

Mark Matienzo: How to Meet People and Have Fun at Code4LibCon

Stefano Mazzocchi: A Bookless Future for the Libraries?

TODO: Listen to Jon Udell interview

Interesting talk. Began by talking about how marginal costs of communication have been dropping over history, from cave drawings to clay tablets to the printing press to electronic publishing.

Pros to this, but cons now, too, like a "degraded consumption experience:" low screen resolution, batteries required, poor network access.

Business models are being disrupted. Institutions like libraries are being disrupted. Almost-zero marginal costs are here to stay.

Libraries vs museums of books. Non-unique books can all go online in electronic versions. (Unique books still require special attention.)

No more shelves. Nearly infinite storage space. (He's getting into a lot of "the book is dead" stuff---tide turning against him in IRC.)

Do we still need metadata? (Shouts of "yes!"). Does metadata have to be made by people or can it be done by computers with statistical analysis?

Information is fragmented. Can the library mindset still work across a spectrum of such fragmented information, hyperlinks, journals, networks? Networks of relational assertions = the web of data.

He asked a few questions ("who's the youngest 2008 Academy Award winner") that were basically trivia questions. Where would we find the answers? His answer: Freebase. He found the answer to that question easily enough, but his talk ran into trouble on the next one when he tried to show how Freebase made the answer easy to find and the whole thing failed due to network slowness. He lost some of the audience here.

Showed FMDB, a granular linked-data version of IMDB data. Showed a translator. Showed http://typewriter.freebaseapps.com/, where they crowdsource specifying unspecified data. Showed the Genderizer (!?).

[A few days later Mazzocchi posted Post-Mortem of a Dissonant Keynote: he went into the #code4lib channel logs to see what people were saying while he was taking. That took guts. Great post.]

Anders Soderback: Why Libararies Should Embrace Linked Data

[I talked to Anders for a while after this. Very nice fellow, and Libris is a great piece of work.]

Libris, the Swedish national union catalogue.

Less technical version of the talk he gave the day before. Martin Malmsten was supposed to be here but couldn't make it. He talked about the web, that it's social, that it's a network, etc. He spent a while getting to the actual catalogue. When he did, he showed the basics of it: the RDF version, the frbr-related, different representations, etc.

Their blog: http://blog.libris.kb.se/semweb/

Ross Singer, Like a Can Opener Through Your Data Silo: Simple Access Throgh AtomPub and Jangle

http://jangle.org/

Problems with library systems APIs: not very good. OAI-PMH etc. Atom + REST = AtomPub, an Atom publishing model. Ed Summers pointed out a couple of years ago how well this would work. Google, Microsoft, IBM, WordPress, Drupal, Movable Type, etc., all use it.

Jangle applies a common data model to library information through AtomPub. Four kinds of entities: resources, items, actors (users etc.), collections.

Jangle vocabulary: http://jangle.org/vocab/

Ross is building connectors to various systems so that it doesn't matter what kind of ILS you're using (ideally) you can always talk to it with Jangle.

Godmar Back complimented the idea and the importance of interoperability.

Glen Newton, LuSQL: (Quickly and Easily) Getting Your Data from your DBMS into Lucene

Glen's from CISTI in the Digital Library Research Group. They have 8.5 million full-text articles and want to index them all to make them easy to search. They use Lucene but not Solr, they use LuSql. He showed how to do some queries with LuSQL, talked about how the indexing was done by Lucene, showed some output.

Terence Ingram, RESTafarian-ism at the National Library of Australia

My attention was caught when he said they used VuFind and asked how many people here used. Laughs when Andrew Nagy stuck up his hand. Ingram said they just needed something so had a quick look around and chose VuFind. Very casual. All those people who look to the NLA as a guide in their own choice of VuFind will be surprised!

Birkin James Diana, The Dashboard Initiative

http://library.brown.edu/dashboard/info/

Showed some widgets that make a dashboard for showing all of the activity going on at a library, with widgets for circ, ILL activity, other kinds of things. Has some nice visualizations in there using some kind of Google visualizer. People in channel also mentioned Flot. The code is available for download. Open source!

Lunch

Ed Summers and Michael Giarlo, Open Up Your Repository with a SWORD

http://swordapp.org/

"SWORD is a lightweight protocol for depositing content from one location to another. It stands for Simple Web-service Offering Repository Deposit and is a profile of the Atom Publishing Protocol (known as APP or ATOMPUB)." Lets you deposit into repositories without worrying exactly what kind of repository it is: Fedora, DSpace, whatever. Ed explained it and Mike showed some examples. Looks easy to use and very useful if you need to do what it does.

Mark Matienzo, How I Failed to Present on Using DVCS for Archival Metadata

He proposed to talk about how DVCS would work for archival metadata but found it was too complicated. Slide: "I failed, epically." He looked at bzr and git but then went with Mercurial. Problem: It works on line diffs (for code) but EAD is in XML. How do you diff the XML? There are various tools, but in the end the whole thing turned into a complicated mess.

Godmar Back, LibX 2.0

LibX

Summarized LibX 1.0. But a toolbar is great, though what about emerging technology trends (mashups, SOA) and educational trends (online tutorials, social tagging, visualizations)?

In 2.0, librarians can create Libapps (?) and users can add them into pages where they want. LibX runs/houses them. Libapps are made from reusable modules.

How to make a libapp? Modules is JS plus metadata description, a Libapp is a group of modules, a Package is a folder of Libapps.

Modules: Named at a URL, published with AtomPub. They use tuples in JSON. { isbn: "074322670" }

Example: user goes to ACM site. There's a LibX button. Click on it and a YouTube video shows up, with person giving some help! E.g. Annette Bailey.

He showed the code to implement that, just a bit of JS. Simple. Some other bits of code needed, but all pretty easy.

LibX 2.0: been rewritten in full browser-independent OO. about:libx gives built-in documentation. They have unit tests and it's all hot updatable (?).

Roles in the LibX 2.0 world: developers, adapters, user community.

LibX community repository will be built, to hold modules, packages, etc. Coming over the next two years.

Gradual transition from LibX 1.5, not a huge major release.

Kevin Clarke and John Fereira, Djatoka for Djummies

Kevin gave an overview of Djatoka, then John did some demos.

Could we use this for the big maps that in Maps? Instead of Zoomify? Lots of overhead, though---runs on Java, requires Tomcat, etc. But worth a look.

Breakout sessions

LibX 2.0, Zope3/Grok/Plone, Fedora, Solr, Jangle. I went upstairs and napped.

Lightning Talks

David Lindahl, XC

Quick overview of XC, but he lost me by showing a couple of short cartoons while was talking, which distracted me.

Casey Bisson, Scriblio

The internal data model is improved and now you can do original cataloguing in it. It's a digital library out of the box now, too, and people are using it for that. Some kind of tie-in to LibraryThing's Common Knowledge, too.

Mark Matienzo, enjoysthin.gs

Showed this new social bookmarking site.

Emily Lynema, E-Matrix

ERM she's working on at NCSU. FRBRy data model. E-resources can be "narrowly related" or "core" to a subject/discipline. Interesting! Gradations of relevancy to a program or subject. List them on subject guides.

Eric Lease Morgan, Alex4

http://infomotions.com/sandbox/alex4/

Goal: facilitate a person's ongoing liberal arts education.

Geoffrey Bilder, "Cool URIs Must Die."

He's from CrossRef. They're fighting linkrot. "Persistence isn't a technical issue, it's a social issue."

John Law, Summon

http://www.serialssolutions.com/summon/. Searches everything a library has: physical, virtual, etc. One search box.

Erik Hatcher, LucidFind

http://www.lucidimagination.com/search/

They're searching everything on lucene.apache.org: mailing lists, bugs, etc.

Mike Taylor (Index Data), Making Distributed Configuration Simple with the Torus

Metasearch engine: http://indexdata.com/pazpar2/

They've done a way of simply specifying exactly where you want to search and what fields you want to show in the results. Translucent Record Store = Torus

Jakub Skoczen, also from Index Data, on how Torus is implemented

Michael Klein and Jonathan Brinley on zoia's FOAF support

Fun!

Andy Ashton, Biblio

He's at the Scholarly Technology Group at Brown. One of many projects called Biblio. Biblography project.

Naomi Dushay, VuFind at Stanford

http://searchworks.stanford.edu/

Related results. Showed FROGS OF AUSTRALIA page. OK. LITTLE BEAR, good. ASSAULT WITH A DEADLY DONUT: bad!

searchworks-test.stanford.edu: They did a shelf-browse across all of the libraries, including storage. They show it in a right nav, which is a good idea. Should use this.

Mike Beccaria, Zoom Zoom Zoom

Paul Smith's College: paulsmiths.edu They've scanned in old yearbooks and put them in ContentDM. Didn't like it.

Showed Microsoft's Deep Zoom. Looked cool. Worth checking---for Maps? He put a whole yearbook into one image and then put it into the system, so you can zoom from a full overview right into something very small.

Photosynth: took pictures of the stacks and showed how you can move around them and zoom in on spines. Could link to the catalogue?

Dan Chudnov, BagIt

http://digitalpreservation.gov/ uses BagIt.

Wednesday 25 February 2009

Sebastian Hammer, Index Data

Talked about Index Data and their history. Got into books and libraries in general. Whithr libraries, with Google? They're digitizing things better than we are. Libraries could be swallowed by whipper-snapper technology.

Why (Local) Libraries: bearers and presevers of cultural heritage; conveyers of authoritative information; supporters of learning and research; pillars of democracy

"Even if we end up dying, we can't go quietly."

Libraries need to stay local but also come together: consortia, a group of libraries working together on web sites, turn into a "super-robot."

Quoted Lorcan Dempsey on "stitching costs." Most so-called APIs are loyalty schemes rather than interoperability devices. Standardization is hard/boring but essential for collaboration. Need more collaboration and working together, while keeping our business models. Systems and organizations need to surrender our data freely. Library hackers must become adocates within their orgs.

"MARC is a joke. Z39.50 is for old people. SRU is dumb. NCIP ... forget about it."

Tim McGreary, OLE Project: A New Frontier

OLE is "working to redefine the business processes for libraries."

They want to be "flexible, adaptable, community-developed," they want "improvement beyond the ILS." Format and resource agnostic. No need for a separate ERM. This is not just an ILS replacement: special collections, video, DRM, everything. They want a service-orientd architecture.

Community-based software development and governance. Raise the Library system to the enterprise level. Complement human interaction.

He reviewed the project timeline. Ah, project timelines.

They're using the National Library of Australia Services Framework. (What is this?)

In March: OLE core services will be defined. "Final design document" to be published in July, and then another group will do stuff.

Bess Sadler, Blacklight

http://blacklightopac.org/, http://blacklightrubyforge.org/

Give up on the idea that we can do a single interface that will work well for everyone. Different kinds of students have different needs.

Top question at music ref desk: I play violin, my boyfriend plays piano. What can we play together? Their old system wasn't indexing this information. Not everyone would want to search for that, but music students would. Should let them do it. Another q: can I find things by era?

They have a music portal: new books on home page are all music books.

Special behaviour for special objects: any musical recording, they go to Musicbrainz, get the unique ID, then go to other services to get metadata that key on that ID.

"College students are broke and they like music. If we are spending lots of money on music and not letting them know about it, that's a fail."

Blacklight runs on Rails. Uses Rails Engines so it's easier to keep up with the code base but make local changes.

TODO: marc4j is now much better ... look it up.

TODO: Install Blacklight and try it out. Look at Rails code.

Joshua Ferraro, biblios.net

He's from LibLime. Explained about their business. Open Library, Open Data Commons License.

Look at: extJS

"biblios.net is LibLime's free browser-based cataloguing service." 35 million freely-licensed records.

TODO: Make OpenFRBR query biblios.net (requires biblios.net account)

Look at: bws.biblios.net: APIs that let you interact with the database

He downloaded a MARCXML record from Blacklight and loaded it up into Biblios, using curl at the command line.

They're working with ONIX records, too, but keeping them separate from MARC because they're made with different rules.

Authority files are available. Useful?

This is a great project. York should be uploading everything there.

Chris Catalfa, Adding Functionality to Biblios with CouchDB

UI: extjs.com + jquery.org

1. Implementing an editor for Dublin Core. He showed a bit of XML that constructs an editor. (Faint code, couldn't read.)

2. Then they use CouchDB on the backend.

He was showing a lot of code snippets and a lot of unreadable XML.

Toke Eskildsen, Complete Faceting

Summa

Lunch

Erik Hatcher, The Rising Sun: Making the Most of Solr Power

10.
9. Performance
8. Memory
7. Query parser: default is to show the raw query parser
6. Data import: bring data into Solr with Data Import Handler, Solr Cell, CSV, LuSQL, APIs.
5. Request handlers: leverage Solr's configurability
4. Solr and IR toolkit
3. LocalSolr
2. Faceting: can do multi-select faceting
1. The interface is the app. (Solritas, Velocity)
0. Community

Lucid Imagination: they do support and training for Solr.

TODO: Look for screencast Erik Hatcher did showing off Solr and how to get it working.

Chris Shoemaker, FreeCite: An Open Source Free-text Citation Parser

http://freecite.library.brown.edu/

API URL: POST to http://freecite.library.brown.edu/citations/create

JSON responses implemented today. Give FreeCite an unformatted citation and it will give you back a formatted citation. Nice piece of work ... except it gives back almost-OpenURL?

Richard Wallis, Squeezing More from the OPAC

People may like facets and relevance, but they also want links to Amazon and Google Books and other places outside the catalogue.

JUICE: Showed injecting Javascript into pages on the fly, in the browser, so that when looking at something in a catalogue, it added links to WorldCat/Google Books/LibraryThing/Open Library/etc., either as text or as images.

liblists.sussex.ac.uk/lists/v3029.html

Inheriting CSS of the page means things look the same.

Just a matter of pasting in a bit of Javascript, built on a template, looks pretty easy to build.

Sean Hannan, Freebasing for Fun and Profit

Intro to Freebase. They slurp up info from Wikipedia, Musicbrainz, etc., plus people can also edit it.

REST API: http://api.freebase.com/

Example: embedddable snippet of code that links Academy Awards data to entries in own catalogue. Hmm! I could use this.

Example code: http://code4lib.mrdys.user.dev.freebaseapps.com/

Lightning Talks

Lots of good talks here. I was a bit distracted through some of them because I was getting people to sign the release forms and was keeping trak of who had signed what.

Chris Fitzpatrick: thinkbase? Check it out.

Thursday 26 February 2009

Ian Davis, If You Love Something ... Set It Free

The Semantic Web has fundamentally changed how people, computers and information interact, not because of the semantics, but because of the web and how easy it now is to connect information.

Conjecture 1: Data outlasts code. Therefore open data is more important than open source.

Conjecture 2: There is more structured data in the orld than unstructured.

Conjecture 3: Most of the value in our data will be unexpected and unintended. Therefore we should engineer for serendipity.

"The goal is not to build a web of data. The goal is to enrich lives through access to information."

"Technology grows exponentially, but society adapts linearly."

Warning: We should exchange and share information, but we must be careful that there are people involved, and some data should be kept private and protected. But most of it doesn't need to be.

Ed Corrado, The Open Platform Strategy: What It Means for Library Developers

Marshall Breeding: TOC of open source is roughly equivalent to commercial. But a big ARL found that open source software would costs 30% (?) more than commercial over a few years (cite: Grant?).

Talked about Ex Libris's Open Platform program.

Adam Soroka, A Modern Open Webservice-Based GIS Infrastructure

Chris Beer and Courtney Michael, Visualizing Media Archives

Cool demo built on the conference FOAF data: http://ratherinsane.com/~chris/c4l09/index2.php

Lightning Talks

Christopher Morgan, http://bookgenius.org/beta

Shows visual structure of subject headings. Try "evolution." He likes Thomas Mann's book on library research and emphasis on use of LCSH.

Ross Shanley-Roberts, Extracting Data from III with Expect

Rosalyn Metz

Showed how easy it is to set up a server in Amazon's EC2. Really only takes a few minutes! Not cheap, though. Great talk!

Heikki Levanto

Another Index Data guy talking about their stuff. This guy talked about regexes and parsing web pages.

Mikkel Erlandsen, Summa

Jonathan Rochkind, Umlaut

Richard Wallis, Open Catalogue Crawling Protocol

Xiaoming Liu, checking links and URIs

TODO: look at Firefox plugins: LinkChecker, LinkEvaluator.

TODO: look at command-line: linkchecker

Syndicate content