I came across a site that does something very similar to Google Suggest. When you type in 2 characters in the search box (e.g. "ca" if you are searching for "canon" products), it makes 4 Ajax requests. Each request seems to get done in less than 125ms. I've casually observed Google Suggest taking 500ms or longer.
In either case, both sites are fast. What are the general concepts/strategies that should be followed in order to get super-fast requests/responses? Thanks.
EDIT 1: by the way, I plan to implement an autocomplete feature for an e-commerce site search where it 1.) provides search suggestion based on what is being typed and 2.) a list of potential products matches based on what has been typed so far. I'm trying for something similar to SLI Systems search (see http://www.bedbathstore.com/ for example).
                 
                                                                            
                            This is a bit of a "how long is a piece of string" question and so I'm making this a community wiki answer — everyone feel free to jump in on it.
I'd say it's a matter of ensuring that:
- The server / server farm / cloud you're querying is sized correctly according to the load you're throwing at it and/or can resize itself according to that load
- The server /server farm / cloud is attached to a good quick network backbone
- The data structures you're querying server-side (database tables or what-have-you) are tuned to respond to those precise requests as quickly as possible
- You're not making unnecessary requests (HTTP requests can be expensive to set up; you want to avoid firing off four of them when one will do); you probably also want to throw in a bit of hysteresis management (delaying the request while people are typing, only sending it a couple of seconds after they stop, and resetting that timeout if they start again)
- You're sending as little information across the wire as can reasonably be used to do the job
- Your servers are configured to re-use connections (HTTP 1.1) rather than re-establishing them (this will be the default in most cases)
- You're using the right kind of server; if a server has a large number of keep-alive requests, it needs to be designed to handle that gracefully (NodeJS is designed for this, as an example; Apache isn't, particularly, although it is of course an extremely capable server)
- You can cache results for common queries so as to avoid going to the underlying data store unnecessarily