The No.1 i-Technology Magazine in the World !
   
 

      ES no BS
This is a personal blog of Yakov Fain       My son's animations and music         No BS IT podcast      Америчка




Archives

««Sep 2010»»
SMTWTFS
    1
2
34
567891011
12131415161718
19202122232425
2627282930

RSS Feed








Subscribe to these blogs

Search Engines still can't index Flash sites with dynamic content

posted Thursday, 25 February 2010

There are two types of Web applications: those that care about their discoverability and those that don’t.
If you are developing a Web application in Flash or Flex for, say, financial advisers (FA) of a brokerage house or salesmen of an insurance company, discoverability is not a concern because FA’s or salesmen of your firm will be told, “Go to so-and-so URL and use so-and-so application”. We can happily say that both FA’s and salesmen have discover their RIA.

If you are developing a consumer-oriented RIA , you want random people to discover their pizzeria, department store, medical office or a car dealership.  Why a doctor needs a Web site? Mainly to be discovered by people from the neighborhood if someone will google for a family physician. You can say that some doctors also use Web site to post some useful articles or medical form for their existing customers. True. But still, the main reason for investing into development of a Web site is to increase visibility.

Recently, I wrote a blog about iPhone and Flash , and half of this blog was about excellent Google indexing of the plain HTML information about services of our company.
Today, I’ve read an article by Armando Roggio titled “Best practices for Search-Optimized Flash Development” . This article was supposed to give you a feeling that Google really knows how to peek inside your swf file.  The author states, “Need proof? Run this search on Google right now, filetype:swf + “comic books.”

Sure enough, if you’ll do it, the search will return several .swf files that have the words “comic books” hard-coded inside.  But most of the RIA bring the content dynamically and rightly so. A well designed RIA arrives to the client with the code that makes the screen pretty. The information about the upcoming sale of your local Ford dealer or recall of millions Toyota cars is not hardcoded into the Web site, but is dynamically downloaded from the server.  This creates a nice separation of responsibilities – people who create the content of the site don’t have to modify the Web site every time they want to announce a weekend sale at Ford dealerships.

Will Google index and find dynamically loaded content? I don’t think so.

Mr. Roggio quotes in his article executives of Adobe and Google stating how they collaborate and work hard on improving the situation.  I’m sure they do, but so far I don’t see the results.  

That article also states the following:
In fact on June 18, 2009, Google announced that it could load external Flash resources, including text, HTML, XML, additional SWFs, and more. This feature means that you can create a Flash application that draws its content from a structured and external XML document.” And a little later the author states, “As of this past summer, Google can and does retrieve this external files.

I love Google and use it a hundred times a day, but this announcement (if it really was made by Google) is simply not true, and I can easily prove it to you.

Once again, I’ll direct you to the Flash based Web site of our company . Now I’ll reveal some insider’s information.  Our Web site is written in ActionScript 3, but the content of every view you see there comes from external XML files.

For example, if I decide to change the information on the Home view, there is no need to recompile the Web site. I can just use Notepad, TextEdit, Vi, or any other available plain text editor to modify the content of the home.xml file that is being downloded as soon as the compiled SWF file arrives to the Web browser.
Let’s an experiment. The Home view of our Web site starts with the following long statement:

“Do you want your RIA application to be done right the first time? We've built lots of great RIA applications for our clients. Our teams of Flex developers are geographically located in the USA and Eastern Europe. “

Now, copy/paste this long text fragment (with or without double quotes) to Google’s search field. Hit Search. Do you see Farata Systems anywhere in the vicinities? I don’t.
This Web site was deployed about three years ago, and this poor little text we’ve been using in this experiments hasn’t been changed for a couple of years either waiting for any search engine to notice and index it.

You may ask, “Did you use deep linking available in Flex to provide unique URL’s for different views of the in your Web site helping search engines to index the content?” No I didn’t. Our Web site was created in ActionScript long time ago to be as light as possible.

But I know a firm with a complex commercial Web site written in Flex that did use deep linking feature,  but it didn’t help. They are considering re-developing the Web site in HTML/JavaScript. Unfortunately, this won’t help, unless they will put the entire content of the Web site inside HTML (but this is not possible and stupid). Using AJAX won’t make them happy if they decide to keep the content dynamically loaded.
If you’ve developed a large Web application with deep linking Flex feature and have good results in terms of Web analytics, I’d love to hear from you.  

Now the happy end.

I’ll continue recommending Flex and Flash as the platform of choice for developing of the enterprise RIA.  Don’t pay attention to those who predict that HTML5 will kill Flash. It won’t happen for another 10 years, and we need to develop and deploy our RIA today. The fact that someone can afford to pay H.264 patent licensing royalties and stream the video has very little to do with a robust and well written platform for enterprise RIA such as Flash.

But if your cousin Vinnie asks you to develop a Web site for his new Italian restaurant, use Flash very carefully - mainly for embedding interactive widgets here and there.

tags:    

links: digg this    del.icio.us    technorati    




1. Anatole Tartakovskt left...
Thursday, 25 February 2010 4:12 am

Yakov,

  • Developing deep linkage and fooling engines is the only way to go - the best approach I saw to date was by Ted Patrick http://onflash.org/ted/2008/01/flex-and-seo.php compiled with deep linking. However I would be reluctant to offer that to your customers as Google is known for resetting page indexes if they determine to be manipulated. I would recommend your client to contact Google directly and get service agreement on the technique used to present deep linked data and XSLT/dynamic context for crawler

Regards Anatole


2. Yakov Fain left...
Thursday, 25 February 2010 7:50 am

@Anatole I'm familiar with the theory, but still, I'd like to hear success stories with deep linking in real world applications. I want someone to show me what has to be done to ensure that when I enter a 32-word sentence in the search engine, it opens my Flash view that has these words in the same order. I'm just asking to get the same or close results as I'm getting with the search on HTML pages. It would be really helpful if one of the Adobe tech. evangelists created a sample application of the complexity of faratasystems.com with XML-based content demonstrating before-after cases that really improved the visibility of the site. As to your advice to contact Google, I have my reservations. 4 months ago I was attending Flash/SEO presentation at Adone MAX'09. It was done by a savvy Adobe engineer... who admitted that THEY were having hard times getting to understand what Google does in they headless player Ichabod.

But I'll try to follow your advice and suggest my cousin Vinnie to contact Google. Actually, if I'm not mistaken, his favorite search engine is Bing.


3. Anonymous left...
Friday, 26 February 2010 12:18 am

Common denominator for search engines to discover content of a site is still HTML.<br/><a href="http://www.universalmind.com">Universal Mind</a> had a website built using Flex/Flash and their website now no longer requires Flash and their search ranking probably will go higher in due course of time. I am sure search engine discoverability is one of the reason. They started using javascript library jQuery. jquery UI and other javascript libraries like YUI, etc. have matured a lot and provide eye-candy animations. The MINIUSA, Mercedes AMG are unique websites that compromises SEO(discoverability) and user experience using Flash where they inject flash for each relevant HTML page. They have alternate HTML pages and they expose the HTMLs for SEO. For discoverability you need to have many linked pages rather than 1 page with various application states in flash. Most of them are probably using SWFAddress to achieve deep linking and unique URLs even though there is only 1 HTML file. The content should be assembled at the application server tier and have javascript library do the eye candy animation. This means that content can still be published like in your case the way you do today and still have the taste of RIA without Flash for Farata's website. One more advantage is that it will get rendered in a device that doesn't support Flash (iPhone, iPad etc). However if the application demands interactivity with charts rather than discoverability, then HTML is probably not a choice or if you need to explain a story then Flash application/Flash video is a better choice. HTML5 Canvas tag and javascripting would bring in a new dimension. There is one more advantage to use the common denominator as HTML. If your content is rendered for each section of your site as separate HTML pages then you can inject Flash SWF being the UI wrapper to render the same content exposed in the HTML. Now you have best of the both world - search and flash experience for each page. Probably it would be better choice for Farata's website too.


4. Yakov Fain left...
Friday, 26 February 2010 8:42 am

@Anonymous I hear you. What's good for cousin Vinnie is good for everyone :) Having a JSR168 portal with insertions of Flash widgets seems to be the way to go for consumer facing RIA. But how your JQuery or YUI solution will help in indexing of external dynamic content that is being fed to your site? HTML5 is not a standard in Web browser's world and won't be for years. Are you willing to get back and deal with constant browser incompatibilities?