Posts categorized “School”.

University of Michigan Open Access Week

There is a great event coming up at the University of Michigan, sponsored and coordinated by a great team of librarians: Open Access Week 2009.

Molly Kleinman, one of those great librarians, puts it into context for us:

I’m struck by how timely these events are, and how much we could conceivably do under the umbrella of discussing open access and the future of scholarship. … The confluence of circumstances nationally has made this the perfect moment to discuss what’s wrong with existing modes of academic publishing, and to start getting aggressive about making change.

You really should read the rest of Molly’s post for a wonderful explanation of why the current scholarly publishing system is failing for everyone except the Elseviers of the world.

Along with presentations focused on faculty and scholarly publishing models, there is also going to be a talk by my current boss, Nathan Yergler, CTO of Creative Commons. Nathan will be talking about the impact of Creative Commons (CC) licenses on Open Access, what challenges still exist for Open Access, and what the Creative Commons is doing to build and support an ecosystem of openness. Everyone is welcome to join this event, and all the events during Open Access Week. For the details about Nathan’s talk, check out the announcement on the OPEN:Michigan blog.

If you are in the South East Michigan area and are interested in what Michigan is doing to promote Open Access and make it really work, come by for any of the events; there should be a wide enough range to accommodate most interests.

The HathiTrust – A Report for the ALA Office for Information Technology Policy

This past week was Spring Break at the University of Michigan. So I decided to skip the trip to the beach and instead go to Washington DC to work 9-5 for a week. Really.

My school, the School of Information, has this neat program called Alternative Spring Break where students can go work with some really cool organizations in Washington DC, New York, or Chicago. It is an opportunity to go discover if you actually enjoy doing what you are in Graduate School full-time to learn (my words, not theirs). Also, it is a wonderful networking opportunity; I met some really great people last week and whether or not they can help me find a job is secondary.

I specifically worked for the American Library Association’s Office for Information Technology Policy. This is basically the “think tank” for the ALA Washington office. The Washington office also has the people in the Office of Government Relations; the people that go out there and make sure that the libraries’ perspective is heard on Capitol Hill. It is a really important perspective: who else are as big of proponents of open access to knowledge for all people? who else guards your privacy to such a great degree? Librarians are wonderful people to have on your side, but watch out if you do something wrong.

My time at the OITP involved writing a report about the HathiTrust, an endeavor originating at the University of Michigan and the University of Indiana. It is, in the most simple of terms, a long-term digital works preservation project. It is preserving and providing access to all of the digital scans that are being given to the various member Universities from the Google Book Search scannning program and also the libraries’ internal scanning operations. But there are some important implications of the HathiTrust, and that is what I set out to find. I want to give special thanks to John Wilkin, Executive Director of the HathiTrust, for answering my many questions.

If you are curious what the HathiTrust means for you and libraries in general, feel free to read my report: The HathiTrust – A Report for the ALA Office for Information Technology Policy, it is licensed under a Creative Commons Attribution-Share Alike 3.0 Unported License, so feel free to share it with whomever.

Scholarly Publishing and Authenticated Reviews

First, a review of a neat new tool that provides a cool function for many academics:

GPeerReview is a very simple Open Source tool that lets you write a review of a work, embed a hash of the work in your review, and sign that review with your digital signature (using your GPG key). The last two things are pretty neat. The hash allows you to be sure that people know which version of a paper you reviewed. Or at least, they will know if the version they have matches the version you had. This would be useful in the case where major changes are made to the paper that contradict your review.

Then, signing your review so that the author (and their publisher/advisor/dean/what have you) knows it is actually from you is pretty neat, and an obvious use of gpg. In fact, GPeerReview is essentially just a wrapper around the GnuPG command-line tool (see the FAQ).

I think this is a pretty interesting tool that could have some great uses, especially if we integrate it with the work-flow of academics (somehow). Step one of that implementation would be to move it from the CLI to some sort of Word/OpenOffice.org plugin. Or, even better, would be to provide a web-based service for this.

Crazy Idea
Launchpad for Scholarly Articles and GPeerReview

Going back to my crazy idea of a Launchpad for Scholarly Articles: basically a service that provides users the ability to link published articles, whether open access or not, with pre-prints or author deposited versions in Institutional Repositories. The killer feature of this service would be to provide a way for people who DON’T have access to the expensive scholarly journals a way to read and be informed via the pre-prints written by the authors that are not restricted by the overzealous journal publishers.

Then, add on the ability for readers of those articles to make comments on and provide useful reviews of the material. Even adding this ability to places like arxiv.org would be great; it provides a mechanism to build community. And as we all know, the community is what makes any service an important resource for people. Without community the service is just a collection of tools.

But, I’ll be honest with you, I don’t know all of the various web-based services out there for scholarly communication; maybe someone has already implemented something like this. Leave a comment if you know of anything out there like this.

Google Book Settlement

This is old news now since it happened over a week ago, however, the continued discussion of this settlement is needed and hopefully welcomed.

I have been silent on this settlement on this site due to a few reasons (full disclosure):

  • I was at the Open Content Alliance’s (OCA) yearly meeting in the Presidio of San Francisco when the settlement was announced. As such, I was privy to the private discussions between members of the OCA and others. I didn’t want to say anything I learned there before they had a chance to say it themselves.
  • I work with a very high level administrator at the University of Michigan Libraries. The UofM Libraries are one of the Google Book “Fully Participating Libraries” and as such have a special relationship with Google. This relationship may cause members of the UofM libraries opinions’ of this settlement to be influenced in one direction or another.
  • I have a personal moral preference to the methods of the Open Content Alliance and feel that some of Google’s Terms Of Use (in the contracts signed with libraries) are less than good.
  • There have been many people saying contradictory things about this settlement; everyone couldn’t be right in their analysis. Just like sunlight is the best disinfectant, time is the best producer of truth.
  • The settlement is one-hundred and forty-one (141!) pages long. This doesn’t include the fifteen (15!) attachments to the settlement. This is part of why so many were making false claims, they just didn’t get to the part that explained what would happen in the situation they were talking about.
  • Plus, I was going to be giving a presentation on the Google Library Project for my class on Intellectual Property and Information Law (PubPol 688/SI 519). I decided to wait until after the presentation to post my views. I could have posted a draft of my presentation before to see what sorts of comments I would receive but to be honest, I wasn’t thinking that far in the future. Graduate School does that to me.

 

Here is the presentation I gave yesterday (2008-11-7):

(.odp, .pdf, .ppt)
Unfortunately, for you, my slides don’t contain all of the information I conveyed (because that presentation style sucks). Fortunately, for the students in the class, my slides didn’t contain all of the information I conveyed.

You will notice that my presentation takes a very hard look at the Settlement; I’m not one to see something like this and think it is the best outcome we could have had. Yes, there are some really great things to the settlement but that doesn’t mean I can’t critique the parts that are bad.

A quick example of one of the really great things the Settlement provides: All “Fully Participating Libraries,” libraries that have signed scanning agreements with Google and have had a sizable percentage of their libraries scanned, will have free access to the entire corpus of books Google has scanned. Not just the books that were scanned at that specific library, but the books scanned at all libraries. So, if you are a student at the University of Michigan, University of California, Stanford, or any of the libraries listed in Settlement Attachment G “Approved Libraries” you can be happy about that.

If, however, you are a student at any other university or college you won’t be as happy. Your school, unless it pays the subscription fee (not yet disclosed), will only be able to have a limited number of “terminals” that can be connected to the Google Library; a more correct term would be the Google Bookstore. Even the UofM’s own Paul Courant said this settlement will create the “Universal Bookstore;” he didn’t say “Universal Library.” But I digress….

These other libraries will have a set number of virtual terminals based on the size of their school (1 per 10,000 students or 1 per 4,000 students, depending on the type of school). These are virtual terminals because the access is restricted to a physical computer. The number of computers which have access to the service is a set number, but the computers with access could vary based on demand to any computer within the library.

Issues that I didn’t go into depth in my class presentation that are none-the-less important include:

  • The effective monopoly on the materials that Google now has. Sure, others could join the game, at the $145 million price tag, but since this was a settlement not a legal decision there isn’t a lot of incentive for groups such as the OCA to go into talks with the AAP and Authors Guild.
  • To continue my digression from above: the fact that this is going to be a “Universal Bookstore” not a “Universal Library” is slightly saddening.
    • I don’t have a legal reason to feel sad; the copyright holders have every right to charge for these materials. But I feel like everyone other than Google, the authors, and the publishers are being scammed. Again, not for a legal reason, but for a moral reason:
    • Libraries, through public funding, have been keeping these books safe for the last 70 years. These books, up until the day of the settlement, have had worthless to the publishers and authors. These books are out-of-print and thus all purchases of them have been paid to individuals base don the first-sale doctrine. Now, Google, through its Universal Bookstore, will sell you these books and pay the authors for them. Google will not pay the Libraries who were the ones who made this whole endeavor possible. Sure, the libraries agreed to only get the digital copies back as part of their agreements with Google, but that was before anyone had thought about this possibility. Should those contracts be renegotiated?<end_rant>
  • What Happened to Fair Use?
    • This could possibly be one of my biggest critiques of this settlement: the pure fact that there is a settlement. This was a copyright infringement case brought against Google by two associations, the Associate of American Publishers and the Authors Guild. Google had a fairly good Fair Use argument and may have indeed won the case based on it. This would have been a GREAT THING (most likely). Others would have the same rights as Google as it pertains to the scanning and displaying of books.
    • Now, however, Google is a “special citizen” in this arena; they have “rights” others do not. Is that fair? No. Is that was is best for our future, and the future of libraries? No.

 

Hopefully I don’t sound too negative towards this settlement. Ok, lets be honest, I am pretty darn negative towards it. But hey, that is my job, at least what I see my job being. There are plenty of people out there being paid a large sum of money to tell you how good this settlement is. The ones who are out there telling you how bad it is are most likely not being paid to do so; I’m not.

If you have read this far and are still interested in this topic, you should check out what the rest of the world has been saying about this settlement. A good place to start would be TechDirt’s opinion on the matter. And, the Open Access News blog has posts that summarize others’ opinions in four parts (1, 2, 3, and 4).

EDIT:
Full Disclosure (thanks to Jon for reminding me): I am employed by Creative Commons and through that work have been involved with the OpenLibrary Project. Also, I am employed by Paul Courant, the Dean of Libraries for the University of Michigan. As thus, there may been some conflicting influences on my opinions. I am in a special dual position.

Since it hasn’t been talked about enough already…

So, why should LaunchPad (Malone) be open sourced?*

I’m not going to say because other groups need to use the bug tracking/code hosting/question answering/multi-project-resource unifying features. No, I do believe that it wouldn’t make much sense for there to be multiple Launchpads out there dealing with bugs/code/etc (maybe a little of sense, but not much).

That market is already taken by launchpad.net and others (bugzilla, trac, savannah, et. al.)

Ok, so what market am I looking at? Scholarly communication <BORING!>

Not really boring actually. If you haven’t been paying attention to the scholarly communication world lately, let me tell you, a lot is changing. University libraries are spending more and more money every year on electronic journals. The rate of increase for the same product is higher than that of inflation, for a product which doesn’t improve (can we say monopoly/oligopoly?). In response many institutions (university libraries) are beginning to provide competing services. Full disclosure, my current employer is the Scholarly Publishing Office at the University of Michigan where we publish scholarly journals in an online and Open Access fashion. So, we are providing an alternative to the current commercial publisher vendor lock-in.

What does this have to do with LaunchPad and Open Source Software? Well, we are now in a global situation where there are many many many many open access journals and publications out there. There are some services out there than can help you navigate them, like the Directory of Open Access Journals. But, that service only indexes Open Access journals. Plus, there are now these things called Institutional Repositories, which are collections of preprints and articles and data from the “scholars” in a given “institution” (university, research lab, etc).

Then you have the commercial vendors. They don’t like people looking at their stuff, they don’t play nice with others unless they think they will lose money if they don’t. Libraries are getting better and better at letting their patrons search both sets of journals in one place, but the interface ALWAYS is hideous and creates MANY hoops the user has to jump through. In a word, it is LAME.

I haven’t answered what LP has to do with this yet. I’m getting there, I promise.

What does LaunchPad do really well? Linking various bugtrackers so that people can work together more efficiently to solve problems, right? That was the whole goal of Launchpad, otherwise Ubuntu would have stayed with bugzilla. What is the analogue for the scholarly publishing/communication world? You have those many distinct collections of articles (Open Access journals, Institutional Repositories, and commercial vendors) that do not talk to each other, ever. Yes, there are groups out there trying to improve this situation like the Open Archives Initiative where they are setting metadata standards and standards for transferring that information to others. That is a great thing, but it is only a start.

<The Answer, Finally> If we created a LaunchPad for scholarly works, we could solve many of the beginning access issues associated with this crappy situation. Here’s the idea:

Think of a bug, that is the article in this case. The article (bug) can have a published status like draft version or published in a journal (New, Incomplete, Fix Committed). But for it to even be an article in this Scholar’s LP it needs to have a reference to where it is, physically. So instead of a bug originating in LP and then being linked to other trackers as time goes on, the article needs to have an initial link to some place (OA journal, IR, or Comm. Vendor) using some standard like Digital Object Identifier or Handle.net (which assigns a unique id to object online that can point to any address, so the changing of URLs won’t effect findability).

Then, this article (bug) can also have different versions linked to it. So, example: I publish an article in a prestigious journal, Nature, and I’m proud of it. So, I go to the Scholar’s LP and submit a new article. I give it the DOI or handle.net id and it automagically retrieves the metadata from the article’s current place of residence (that is if the provide it, I might have to fill it in myself). Then it shows up as a new article in the system. My advisor, who thinks the work I did was cool, thinks that my previous drafts before publication are also pretty good. Since the version in Nature is not available to everyone for free, he links the preprint version that resides in my University’s Institutional Repository to my article. That is just like linking to an upstream bug in LP.

Of course, all the metadata is editable and updatable with information like author(s), publication data, place, copyright status (license), etc etc. Plus, if we wanted, we could limit certain metadata elements (like copyright status) to only the article’s author(s), we can do that by verifying emails with respect to what is in the actual article’s author list.

This Scholar’s LP could provide a wonderful unified interface so that “scholars” (define that however you want) can navigate this crazy mess of publishing easily (or at least easier). The “killer app” part of this is the ability to link a published article which is under crappy copyright restrictions to other versions which are available for everyone via institutional repositories or other places, in one place.

There are plenty of fancy cool things which could be done with this model, and I will talk about those later. One example is automatically linking to works cited to another Scholar’s LP or to an external link. But for now, I just wanted to get this idea out there and see if anyone has any comments.

* yes, you are right, we don’t need LaunchPad to be opensourced to do this, it was just a way to get you to read this, sorry.

I’m going to D.C.

The school I attend, the University of Michigan’s School of Information, has a neat little program called “Alternative Spring Break.”

Alternative Spring Break, or ASB to the “in people,” is an opportunity for students to work with organizations for a week; basically a mini-internship.  The positions range from usability testing of a website to archive managing at the National Archives to policy oriented positions.

I, being of the policy persuasion, applied for those positions.  A couple with ALA (American Library Association) and one with EPIC (Electronic Privacy Information Center).  The place that took me is EPIC.

So that means I’m using my spring break this year to go work with a group that tries to protect your privacy rights online, fight for a more open government, and make sure you can speak your mind wherever you are.  Yeah, you’re welcome.

Any questions you want me to ask the people at EPIC for you?  let me know.

My daily backup "script"

So, I am having harddrive issues right now. Luckily I have a backup script run daily that keeps everything important backed up on an external harddrive (which seems to be doing ok right now). I also have any documents for grad school on 3 separate harddrives at any one time (I have learned from my mistakes in the past).

This isn’t a blog post about how thorough I am with backups, it is a post of my backup “script” [1] so others can basically copy past (and edit some of the hard coded stuff) and use it for themselves. I was inspired by being asked “what do you use for a backup tool” and listening to the latest episode of LUGRadio where Aq says that this year his resolution is to release more of his quick dirty programs. Things he would feel bad about normally because the user would need to modify some config file for hard coded values. But, in the interest of Freedom and Sharing it is best to let other people take a look at it and get ideas for themselves.

Well, here is my “script” [1]

Hope it can inspire you to backup your stuff every night/week/whatever. Oh, and to do that part, you need to edit your “crontab” by typing: “crontab -e” in your terminal (no quotes). There are tons of guides online on what exactly to put in there, but here is mine:

# m h dom mon dow command
0 23 * * * ./UserLogs/cron_daily.sh

Which basically translates into “Every day at 23:00, run this script”

There you have it! May your data be safe!

[1] Is a pretty looking list of commands in one file a script? That is for the reader to decide

Updates Updates

New to this blogging thing, so I usually forget I even have one, or when I remember and think of a good topic, I don’t write it down and never write it.

Well… Here is a run down of some updates.

1) Ubuntu Michigan LoCo Team is APPROVED! Check out the announcement: We’re Approved! The members of the team are awesome and made this possible.

2) Top story in the NYTimes morning email: Lawmakers Set Deal on Raising Fuel Efficiency
Good right? Well, yeah, in theory. But guess what the goal is…. an average of 35 mpg by… 2020! Wow, way to be innovative there guys! Way to set a high bar. Way to quit being lazy and actually doing something…. oh wait, no. That is a lame goal and you guys ARE just a bunch of lazy bastards. Got it.

3) Kheir was elected as MSA representative! As legal counsel to the candidate all I have to say is, well done.

4) And the craziness of the end of the semester begins… and I thought grad school was going to be easy.