Are your words smart enough?

it's love

Today, at the O’Reilly Tools of Change conference, we’ll be announcing an initiative to create a new standard for getting and publishing information about words.

We’re calling it “smartwords”, and it will be an open standard — meaning anyone can publish data sets or develop applications using it. Smartwords will be context-aware and real-time … but also lightweight, easy-to-use, and versatile. We’re developing this standard with help from our first smartwords partners, including The New York Times, Forbes, The Huffington Post, O’Reilly, Vook, Scribd, ibis reader, and the Internet Archive.

With smartwords, you’ll be able to access not just traditional “dictionary-style” information, but also metadata, such as how frequently a word is used, where words are used, and who uses particular words. You’ll also be able to publish information about words — if you create a word, you can put a flag in the ground and claim it for your own — and smartwords will enable cool social features, like sharing and tagging.

What would a world with smarter words look like?

— You’re reading a new popular-science bestseller and your reader shows you quick definitions of the most difficult words, set right in the text … based on knowing what books you’ve already read and what words you’ve already seen!

— You’re a consumer and you have a few sources you trust for information (like, say, the New York Times). When you’re reading something from a different source, you can set your ereader to highlight what you’re reading to link you to good definitions (or similar content) in your trusted sources. (Instant fact-check!)

— You’re reading a great new novel and you see a great quote you’d like to pass along — you highlight it and share it on Facebook or Twitter.

The question is: if every word became a smart word, what would you ask it and what would it tell you?

We’ll be releasing version 1 of the smartwords standard in Summer 2010. With this new standard, we should able to do fantastic things with smartwords — and we want to hear from you about the kinds of information you would like to access and the kinds of applications you would like to build. Visit us at smartwords.wordnik.com to learn more!

(There’s more information in this nice writeup about smartwords from the Wall Street Journal’s Digits blog.)

A random walk through Wordnik

Quick note: we’re moving the word-of-the-day and the list-of-the-day off the Wordnik blog. Soon they’ll be available via email (visit your profile page to sign up), but for now you can find them (and us!) on Facebook and Twitter. The word-of-the-day also has its own page.


Sometimes I am asked “How am I supposed to use Wordnik?” and I’m not sure what to answer. “However you want,” seems a little too passive, and “to find out how words work” seems too obvious. So I thought I’d write up a recent walk I took through Wordnik.

I started with Random Word, because, hey, who doesn’t like random? The first word I got was run-a-ball, which is some kind of cricket term. I know nothing about the game of cricket (and in fact have gone on record to speculate that cricket may be an elaborate hoax). I tagged it “cricket” anyway, despite my lack of knowledge of the game, because the sentences were pretty clear (and I can check it with Kumanan and Krishna, our resident Wordnik cricket aficionados, on Monday). After I tag something, I usually check to see what else has been tagged with the same tag, but again, I’m not that interested in cricket, so I decide to look at the cricket entry, instead.

The entry for cricket is pretty good (and the pictures are nicely divided between the bug and the game, and somebody’s adorable kitten and his toy cricket — the rule of Flickr images, given any fairly common word, you will find at least one picture of a cat at that word).

It’s in the related words that I had my first “ah-ha!” moment — one of the synonyms given for “cricket” is “stool.” Huh? I clicked through to the full Century Dictionary definitions set, and I’ll be damned, there it is:

A small, low stool; a footstool. A barrister is described [Autobiography of Roger North, p. 92] as “putting cases and mooting with the students that sat on and before the crickets.” This was circa 1680. N. and Q., 7th ser., IV. 224.

(“N. and Q.” is “Notes and Queries” and that quote is from an 1887 issue.) Then I got curious as to who Roger North was, and found out that he looks like this, and that his letters

give us a delightful picture of the private life of a man of high birth, great abilities, and extraordinary accomplishments, who, after a successful career at the Bar, retired in the prime of life to his country house, and devoted himself to improving his property and exercising an enlightening and elevating influence upon his tenantry and neighbours, while he continued to take a lively interest in all that was going on outside the immediate range of his daily occupations.

(I put his book on my “read someday” list, which list, at the rate it is growing, requires me to live to be 137 years old.)

Going back to Wordnik, I wondered (not being a connoisseur of stools) how many other stools there might be that I knew nothing about, so I clicked through to the main entry for stool and scrolled down to its related words — to find, among others, cutty-stool, whose second definition is:

A seat in old Scottish churches in which acknowledged female offenders against chastity were placed during three Sundays, and publicly rebuked by their minister.

And on that same page there was a Twitter account I didn’t know about — someone else besides DrSamuelJOHNSON is tweeting in the style of Samuel Johnson (if Samuel Johnson were a) alive and b) not inclined to think Twitter a snare and a diversion), 1755Dictionary. Interesting!

So that’s one way to use Wordnik — certainly not the only way, or the most efficient way! — but a fun one.

If you have taken a particularly interesting walk through Wordnik lately, feel free to tell us about it in the comments!

Wordnik & Wordie: Moving Day!

So we’ve unpacked all the Wordie boxes and all the data is moved in and put away nicely, so everything should be ready for Wordniks and Wordies to live together in peace and harmony.

If you’re a Wordie, in the vast majority of the cases your Wordnik username is the same as your Wordie username — to start using Wordnik, all you have to do is reset your password with this link. All your tags, lists, comments, and coolness will magically show up on Wordnik under your username.

A few Wordies and Wordniks had the same usernames — great minds think alike — so we staged a few contests of luck and skill to determine which person grabbed the prize. If your username was affected, you’ll get an email soon with your new username and a link to reset your password. (Again, all your data will show up on Wordnik under your new username.)

If we were pretty sure you were the same person on Wordie and Wordnik (same username, same or suspiciously similar email addresses) we merged the accounts. (Log in with your Wordnik password.) If it was merely a case of mistaken identity and you are now living with a stranger, let us know ASAP!

This was a BIG undertaking (kind of like docking the space shuttle, only without the triumphant classical-music soundtrack), so if you notice any wonkiness, please let us know. We’ll be doing a bit more polishing and dusting over the next couple of days to make extra-double-sure that everything is where it’s supposed to be and does what it is supposed to do!

Announcing the new Wordnik alpha APIs! (UPDATED)

Today we’re happy to announce the alpha version of our new Wordnik APIs! UPDATE: See the video of the announcement we made at the Web 2.0 Summit.

Wordnik’s goal is not just to collect at least some information about every word in English — it’s also to make great information about words widely available, and our alpha APIs are a first step towards that goal.

Our new APIs include:

  • a definitions server, with definitions from The Century Dictionary (other dictionaries will be coming soon);
  • a “frequency” API, which returns a frequency number based on our initial API corpus*;
  • an “examples” API, which will return up to five example sentences for any word that appears in our initial API corpus;
  • the Wordnik word-of-the-day API (so you can create your own word-of-the-day wrapper or widget);
  • and it’s not really a standalone API, but we’re also throwing in an autocomplete API that is useful for making stuff with the other APIs.

You can sign up for our APIs here. Depending on demand, we may have to stagger approvals so as not to overwhelm the servers. If you want a better chance of being approved, give us as much detail as possible about how you plan to use our APIs. Coolness counts (but spelling doesn’t — since we haven’t released a spelling API yet).

Rudimentary documentation is here.

This is just a start — we’re hoping to release new APIs at regular intervals, so if there’s a kind of word data you’re longing to have access to, please let us know!

(* Our initial API corpus is about 3 billion words of running text. The API corpus is slightly different from the corpus that drives the Wordnik web site.)

Our New Look

You may have noticed a new look at Wordnik this morning … just in time for Dictionary Day!

our new logo!

We’re hugely excited about this update, which includes some fun things (like the Random Word link and our new Zeitgeist page).

One other important new thing: logged-in Wordniks now have profile pages. By default, they’re all set to “private,” but if you would like to share your browsing history and favorite words with the world, you can set your profile to “public.”

We’ve made a lot of changes in a very short time, so please do let us know (at feedback@wordnik.com) if you see something broken or missing. We’d really appreciate it!

We’ll be changing (well, adding, mostly) more new stuff in the coming days and weeks, so stay tuned!