It’s just data

Don't Panic

“Just” use HTTP.

This is an updated version of my Attractive Nuisance at Chris Sell’s  Applied XML Developers Conference 5.

Please be sure to read these disclaimers.  And for a good measure, it wouldn’t hurt to read these reviews of my prior talk.


Don’t Panic

MSIE doesn’t handle ', I wonder if the MSN feed uses the same library (instead of being a multiple encoding problem).

Posted by ed costello at

Super Geek Slide of the Year

If you ‘get’ this slide, you maybe need to get a life. (An excellent presentation from Sam Ruby — 125 slides of pure headache material.)......

Excerpt from Stefan Tilkov's Random Stuff at

Sam Ruby: Don't Panic

Wayne Burkett : Sam Ruby: Don’t Panic - ‘"Just" use HTTP.’ Yeah, right. Paul Hammond : Sam Ruby: Don’t Panic - ?Just? use HTTP....

Excerpt from HotLinks - Level 1 at

etech day two

some high order bits from the morning danny hillis of applied minds demoed some of their mindblowing work, much of it in robotics and hardware, but really in every conceivable technological field. imagine a plasma screen tv oriented like the surface...

Excerpt from justinsomnia at

eTech05: "Just" Use HTTP

Sam Ruby talks about why we can’t “just use http”. Ouch. My head hurts. This is important stuff, and I......

Excerpt from All the Pages are My Days at

Don’t Panic

So when you are going to give us the solution? ;)

Seriously though - what are the alternatives? Reinvent XML? Only support a fraction of XML?

Posted by Dan Diephouse at

Just Use HTTP

Sam is talking about using HTTP. He reference’s Tim Bray’s WS-Pagecount article on complexity. The assumption is that HTTP is simple. Its not. There are lots of things to watch out for. Starting with identity: when is “a” an......

Excerpt from Phil Windley's Technometria at

DevCon: ETCON 2005

[link]...

Excerpt from del.icio.us/yohei at

DevCon: ETCON 2005 Sam Ruby's talk

DevCon: ETCON 2005 Sam Ruby’s talk...

Excerpt from Leon's first blog at

Don’t Panic

The default encoding for XML over HTTP can be US-ASCII. (Depends on the MIME type.)

Posted by Anne at

Sam Ruby: Don't Panic

“Just” use HTTP....

Excerpt from paulhammond links at

Sam Ruby's ETCON 2005 presentation

Sam Ruby’s ETCON 2005 presentation made my head hurt. I did the Laura Ingraham groan a few times. What makes this all worse is that almost everything he discusses has a reasonable rationale, but it all combines into a big mess. Via Sam Ruby....

Excerpt from Keith's Weblog at

?Just? use HTTP

Anne van Kesteren : ?Just? use HTTP - An update to his earlier presentation....

Excerpt from HotLinks - Level 1 at

Sam Ruby: AJAX Considered Harmful

Sam Ruby has done more careful and detailed analysis of the issues associated with http writes AJAX Considered Harmful.........

Excerpt from 42 at

Feedtagger

Feedtagger is an online aggregator, currently early prototype but already showing a great deal of promise. As well as a regular per-feed view, it supports tagging. Remember folksonomies, bit of a craze a week or two ago? That kind of thing. . The...

Excerpt from Planet RDF at

Encoding & Layering Fun

I saw an interesting talk by Sam Ruby which was ostensibly about not using HTTP for things which it was not intended. The good stuff starts around slide 33. The thing that I found most interesting was his discussion about......

Excerpt from dmose's blog at

“Just” Use HTTP

I heard that the “Just” Use HTTP session by Sam Ruby was good (it was the first day of ETech, which I missed). Not to fear! The slides for the session are online. They talk about a bunch of issues that go in and around trying to use HTTP - character...

Excerpt from Bitsplitter Blog at

Sam Ruby: Don't Panic “Just” use HTTP."

[link]...

Excerpt from del.icio.us/FalsePositives at

ETech 2005 Trip Report: "Just" Use HTTP

These are my notes from the “Just” Use HTTP session by Sam Ruby The slides for this presentation are available. No summary can do proper justice to this presentation so I’d suggest viewing the slides. Sam’s talk focuses on the various...

Excerpt from Dare Obasanjo's WebLog at

New slide for Sam Ruby

Here you go: This would not register with any other string... details took a trip through HTML forms, email response and finally the app textbox.......

Excerpt from Bill de hÓra at

Three days worth

Forty five things I wanted to write about, just over the last three days....

Excerpt from phil ringnalda dot com at

ETECH05 Summarizing

I was only ultra-briefly passing through, so I’m depending on the intarweb to try to sift through the enormous amount of good stuff. O’Reilly ETECH05 Wiki - surprisingly uncomprehensive coverage - comparing the % of usefulness of sources requiring...

Excerpt from random($foo) at

ETech 2005 Trip Report: "Just" Use HTTP

These are my notes from the “Just” Use HTTP session by Sam Ruby The slides for this presentation are available. No summary can do proper justice to this presentation so I’d suggest viewing the slides. Sam’s talk focuses on the various...

Excerpt from Dare Obasanjo aka Carnage4Life at

Don’t Panic

hi

Posted by huynh duy at

untitled

Of the talks that I attended yesterday, two stood out as particularly interesting. The first was about the launch of the new Google AdWords API. This is a new API meant for use by people who advertise on the Google site (those ads on the right side...

Excerpt from EmergingTechReport at

DevCon: Encoding

[link]...

Excerpt from del.icio.us/rainer at

DevCon: Yahoo! and REST

[link]...

Excerpt from del.icio.us/rainer at

DevCon: Fundamentalism

[link]...

Excerpt from del.icio.us/jdw at

Sam Ruby, Comedian

Can you imagine the trouble that could be caused if letter like symbols were allowed in domain names? In the paper he links from there, letter-like symbols, I learned that there are glyphs for “telephone sign” and “facsimile sign.” That is, #x2121...

Excerpt from Cox Crow at

Many ways to skin a char

So my Ratings and Reviews for CPAN feed is broken. It validates alright, but along the way, content that is already UTF-8 gets interpreted as Latin1 and subsequently re-transcoded to broken UTF-8.This happens because my scraper script downloads...

Excerpt from plasmasturm.org at

DevCon: ETCON 2005

[link]...

Excerpt from del.icio.us/tag/rest at

Don’t Panic

Sam,
Be warned - I have to ask a stupid question, sorry. I’ve viewed your presentation online about 50 times (never have been able to make it all the way out to the West Coast conferences) and still don’t get all this Attractive Nuisance stuff. You have impressed on me how important this stuff is so now I’m having nightmares where some monster is munging my content leaving customers hosed. Seriously, I get the whole encoding / code point thing, but I fail to make the neural connections when I’m sitting here staring at an editor. If I want to product content that is valid XHTML which may end up on a website or transformed into some Atom feed, what should I do, not do? Here’s what I mean: should we be typing © or © ?? Are there 8 (or 80) simple rules for dating html?

P.S. Do you know if there’s any audio/video of your presentation(s) available? I’m sure the slides only tell 5% of the story.
Thanks.

Posted by Christian Romney at

Don’t Panic

I don’t know if/when audio will be released.  I do know that these guys recorded my ETCON presentation.

All modern browsers do a decent job on ASCII, iso-8859-1, and utf-8, and wil understand character references, so not much problem there.

When it comes to feeds, the story is quite different.  Aggregators vary widely in what they support.  Sad to say, but if you want to cater to the widest audience, make sure that your entire feed is ASCII.  That means use numeric character references for all characters about 0x7F.  In some cases (RSS 2.0 descriptions, for example), the best advice is to use HTML escaping, and then double escape the result.  For example: &copy.

Unfortunately, you won’t find this in any specification.

Posted by Sam Ruby at

Don’t Panic

Thank you, thank you, thank you. That finally makes some sense. Re-reading your presentation (I finally just cached the thing) it dawned on me that the MS SQL Server where most of our legacy content is stored was probably installed by the admin with the default settings - this means Windows 1252, Case Insensitive, Default collation. I see a lot of ETL in my future. Can’t wait for the multimedia version of your ETCON talk. Thanks again.

Posted by Christian Romney at

Don’t Panic

You can find more information on dealing with win-1252 here.  Once you deal with the 27 annoyances, everything works out pretty simple: the ordinal value of each byte is identical to the Unicode encoding.  I suggest that you test your code with “Iñtërnâtiônàlizætiøn”.

P.S. Here is the script which was used to generate my presentation.  Install it as a CGI, and you can host it yourself.

Posted by Sam Ruby at

Backpack, Google Accelerator, HTTP

As often happens, Sam Ruby saved me from wasting too much time idly browsing the web (no energy to work on more important stuff on Friday night) with his latest post: This Stuff Matters. Sam links to a post by Robert Sayre on the Google Accelerator...

Excerpt from Ludo at

"Just" Use HTTP

Everyone who have gotten “Just use HTTP” as architectural directive should checkout Sam’s stuff-you-can-use slides. A much needed practical guidance amidst never-ending philosophical discussions of REST vs. WS-* “Just” Use HTTP...

Excerpt from Alexis Smirnov at

Tuesday Highlights

Some highlights from the first day of the Etech conference in San Diego: I got stuck in an elevator with Doc Searls, Steve Gillmor, and 7 other people after breakfast. The Applied Minds presentation was awesome. Danny Hillis started by showing some...

Excerpt from Andrew Wooster's nextthing.org at

Backpack, Google Accelerator, HTTP

As often happens, Sam Ruby saved me from wasting too much time idly browsing the web (no energy to work on more important stuff on Friday night) with his latest post: This Stuff Matters. Sam links to a post by Robert Sayre on the Google Accelerator...

Excerpt from Ludoo at

Identity it’s the Future, Can’t You See?

You know for those of us who might be inclined to string garlic and build a coherent arguments against enterprisey architectures and Snakes On A Plane SOAP, it would help enormously to have a Just Use HTTP answer for the wonderful Infocard. Technorati Tags: infocard, enterprisey, REST...... [more]

Trackback from Paul Downey

at

Identity it’s the Future, Can’t You See?

You know for those of us who might be inclined to string garlic and build coherent arguments against enterprisey architectures and Snakes On A Plane SOAP, it would help enormously to have a Just Use HTTP answer for the wonderful Infocard. Technorati Tags: enterprisey, infocard, REST...... [more]

Trackback from Paul Downey

at

[from leonardr] ETCON: ETCON 2005

Sam Ruby’s presentation “it’s not that simple”...

Excerpt from del.icio.us/network/kevan at

GentleCMS Development Log: Part 3

The extract method is basically done. I’m sure it could be improved a bit more, but it seems to be fairly effective. I added a few extra features beyond the original URI class’s capabilities, such as supplying a base uri to resolve...

Excerpt from Sporkmonger at

ETCON: ETCON 2005

[link]...

Excerpt from del.icio.us/zhangyining/samruby+slide at

Unicode and Ruby

I gave a presentation called I18n, M17n, Unicode, And All That at the recent 2006 RubyConf in Denver. This piece doesn’t duplicate this presentation; it outlines the problem, some conference conversation, and includes a couple of images that you...

Excerpt from ongoing at

Backpack, Google Accelerator, HTTP

As often happens, Sam Ruby saved me from wasting too much time idly browsing the web (no energy to work on more important stuff on Friday night) with his latest post: This Stuff Matters. Sam links to a post by Robert Sayre on the Google Accelerator...

Excerpt from Ludo at

ETCON: ETCON 2005

ETCON: ETCON 2005 by nhoizey type:slides encoding unicode utf-8 uri ws-* rest Copy | React (0) [link]...

Excerpt from Public marks with tag rest at

Blogosphäre (aus JavaSPEKTRUM 01/08)

Die Menge der Blogs ist schier unüberschaubar — die Weblog-Suchmaschine Technorati verfolgte bereits letztes Jahr mehr als 57 Millionen. Jede Auswahl daraus ist damit ausgesprochen subjektiv und von den Interessen des Betrachters geprägt...

Excerpt from JavaSPEKTRUM Blogosphäre at

Add your comment