A Django site.
June 23, 2008

Phil Windley
pjw
Phil Windley's Technometria
» Government Data: the Good and the Bad

While I'm at Velocity, Personal Democracy Forum is happening on the other side of the US. David Stephenson was kind enough to send me a slide share of the talk he'll be giving there on government data feeds and visualization. I couldn't help comparing his vision with the reality that Jason Snell writes about in Alameda County: court documents as individually scanned TIFF documents served up in some crappy Java applet. Heh. Some places have a long way to go.

Tags: egovernment web+services

April 23, 2008

Phil Windley
pjw
Phil Windley's Technometria
» Cloud Computing: Dr. Kai-Fu Lee of Google

Main hall where keynotes
were held.   I love the red slip covers on the chairs.  They were
more comfortable than your standard hotel chair.
Main hall where keynotes were held. I love the red slip covers on the chairs. They were more comfortable than your standard hotel chair.
(click to enlarge)

The opening keynote at WWW2008 is Dr. Kai-Fu Lee of Google.

Before the keynote, we were treated to a presentation that featured dancers in blue Spiderman uniforms, a dancer in what I assume was traditional dress, and a guy with a "Welcome to Beijing" banner running through them all. Somehow, it seemed to fit perfectly even though it was the first of it's kind at any tech conference I've been too--especially one that's essentially academic.

We received a welcome speech from Dr. Yong Shang who is the Vice-Minister of the Ministry of Science and Technology. It basically said "thanks for coming, China's pushing forward with Internet technology." No mention of the firewall. :-) As an aside, the fact that I can find him and his ministry on Google in English speaks louder about what he was saying than his actual words. No doubt the Chinese government understands the power of the Internet. That said, in terms of eGovernment, there was mostly information there, not much in the way of services I could see.

The Internet connectivity has failed and we're not even 30 minutes in. Hopefully it will come back up. I was planning on watching Twitter for news of the Pennsylvania primary. The opening ceremony has gone on for 40 minutes now. Finally we're ready for Lee's keynote.

Cloud Computing: Dr. Kai-Fu
Lee of Google
Cloud Computing: Dr. Kai-Fu Lee of Google
(click to enlarge)

He starts out asking what people want. Many of his answers were specifically about accessibility and it's control. There are four key attributes of cloud computing

  1. Data stored in the cloud
  2. Software services are increasingly moving to the cloud and accessed through the browser
  3. Based on standards and protocols
  4. Accessible from any device

Interesting that this is more or less the Google's core set of beliefs. Companies often distinguish themselves from Google in departing from these principles. The world has moved from hardware-centric to software-centric to service-centric.

Six ideas driving cloud computing:

  1. User centric Data is stored in the could and follows you and your devices. Data accessible anywhere and easily, safely shared with others. He mentions several obvious examples of Google services that meet this definition.
  2. Task centric People don't want to make spreadsheets or write documents. Rather they want to plan a curriculum or collaborate on a business plan. Right now, of course, all Google's examples are simply documents or spreadsheet with collaboration built in.
  3. Power Lots of computers in a cloud can do things you can't do with a single PC. Google search is faster than desktop search because there's lots of computers on the task. Cloud computing isn't just about moving things off the desktop, but bring more data and compute power to bear on the problem.
  4. Intelligent Intelligence comes from data mining of massive data. "A ton of data is more valuable than an ounce of algorithm." I'm not sure that says much. Machine translation is a good example where feeding lots of good translation data into a learning algorithm leads better translation of general text. Storage + analytics = intelligence.
  5. Affordable Of course, this all uses a lot of computers and that gets expensive. Google's strategy is to use cheap machines. 1000 CPU PC-Class machines cost about the same as on 64-way high end machine and give 30x the performance (warning: data may be out of date). The actual numbers at Google are even greater since Google builds it's own hardware. Faulty hardware can be overcome with a sophisticated software layer. This is the heart of engineering.
  6. Programmable How do you program 1000's of flaky servers? Fault tolerant distributed disk storage, distributed shared memory, and a new programming paradigm. Google uses GFS for file storage: every piece of data is replicated three times. Anytime a server holding on of the three chunks dies, the others notice and make another copy. The shred memory architecture is Big Table. The programming is done using MapReduce, a way of creating parallel algorithms. Between Mar 2005 and Sept 2007 the number of processes using MapReduce went from around 72000 to over 2 million!

Cloud computing requires new skills. This is very true. We don't do enough to teach these skills to students. We ought to be introducing parallel computing in the cloud as the second programming course--ensuring that the first emphasizes the building blocks for the second. This probably means it's not in Java.

John Breslin has an excellent write-up of this speech as well.

Tags: www2008 wgoogle cloud+computing web+services

April 17, 2008

Phil Windley
pjw
Phil Windley's Technometria
» Google App Engine at the CTO Breakfast

Not Getting Things Done
Not Getting Things Done
(click to enlarge)

There was a pretty big crowd at this morning's CTO Breakfast. Sam Curran had spent some time building an application on Google App Engine, so we had him demo his app and show us the code.

Overall, Google Apps looks like a very nice piece of infrastructure for building Web applications. The database integration with Big Table and Google's authentication platform add some good tools for quickly building applications.

We got into a pretty large discussion of the pros and cons of Google Apps, Amazon Web services, dedicated hosting, and so on. None of these services are directly competitive. They're complimentary in many respects. You could imagine many applications that would make use of all of them.

Speaking of Sam's application: a few days ago, I mentioned to Sam, Bryant and Devlin, that I liked putting things on lists because then I could get them out of my mind and if I lost the list, I never had to do them. A guilt-free way of not getting things done. The problem with online todo lists is they don't forget. I hate that! Sam picked up on that for his app and created a task list for people consumed with the guilt of unfinished tasks: Not Getting Things Done. Just put your tasks on the list and forget about them!

Tags: cto breakfast utah events web+services google python

April 14, 2008

Phil Windley
pjw
Phil Windley's Technometria
» Persistent Storage for Amazon EC2

With Amazon's Web services, you've been able to store stuff in S3 or SimpleDB. You've also been able to fire up as many machine instances as you liked with storage that went away when you shut the machine down. Anything you wanted saved better be in a database somewhere else, or you had to painstakingly copy it out to S3 yourself. Last night Amazon announced persistent storage on EC2. Now you can create disks in S3 and attach them to EC2 instances. You want a terabyte of storage for your machine, just create it in S3 and mount it.

Another feature rolled out last night is snapshots. Need backup or the ability to rollback? Snapshot the instance and it's on S3, ready to use. You can create new volumes from any particular snapshot.

These two features make Amazon's grid computing platform a very nice place for startups to experiment, develop, and build out. All with little or no capital cost.

Tags: amazon web+services

December 14, 2007

Phil Windley
pjw
Phil Windley's Technometria
» Amazon's SimpleDB

I just posted piece at Between the Lines on Amazon's latest announcement: SimpleDB, a database service in the cloud. I gave it the title "Economics that are impossible to stop" because that what I think Amazon's doing: changing the whole economic model of how people build large scale distributed applications.

Tags: amazon web+services databases cs462

» Amazon's SimpleDB

I just posted piece at Between the Lines on Amazon's latest announcement: SimpleDB, a database service in the cloud. I gave it the title "Economics that are impossible to stop" because that what I think Amazon's doing: changing the whole economic model of how people build large scale distributed applications.

Tags: amazon web+services databases cs462

December 7, 2007

Phil Windley
pjw
Phil Windley's Technometria
» Google Chart API

test chart

Google has released a chart API that returns PNG files from an HTTP GET. The following types of charts are available:

  • Line chart
  • Bar chart
  • Pie chart
  • Venn diagram
  • Scatter plot

The chart to the right was created using this URL:

http://chart.apis.google.com/chart?
  cht=p3&
  chd=s:Uf9a&
  chs=200x100&
  chl=A|B|C|D

Adding charts to Web sites just got a lot easier.

Tags: google web+services

September 12, 2007

Phil Windley
pjw
Phil Windley's Technometria
» Syndication Oriented Architectures

Two of the people I respect the most, Jon Udell and Rohit Khare are together in one podcast: Jon's latest from his weekly Interviews With Innovators podcast on IT Conversations. Jon has a short write-up on his blog about the podcast and it's topic: syndication oriented architectures.

SynOA was born on the open web and is now creeping into the enterprise. To understand why, just consider Facebook. It is a deeply syndication-oriented application. Although Facebook users never have to think about it in these terms, they are constantly publishing events onto a syndication bus while at the same time subscribing to aggregated feeds published by their friends. As a result, they're effortlessly yet comprehensively aware of a large number of summarized event streams. Rohit Khare thinks that syndication-oriented architecture will enable business users to achieve that same kind of awareness.

Good stuff. Rohit has a white paper on SynOA at KnowNow (registration required). It's worth reading to get the meat of what he's talking about.

Tags: itconversations soa web+services rss syndication