Hierarchies and eBay

transcript

Hierarchies, also called taxonomies, directories, and subject directories, are one of the

most familiar ways to organize information. From directories on the web such as Yahoo!

to the local yellow pages, users are well versed in how to locate information using a

hierarchical construct. A hierarchy is a system “of usually manually constructed

categories through which the use can browse starting from broader categories through

which the user can browse starting from broader categories and navigating down the

hierarchy (directory) to more specific ones” (Dennis, Bruza, and McArthur, 2002 ).

Generally speaking, a hierarchy is an organization scheme that establishes parent child

relationships between items in a list, where all children have all or most of the

characteristics of the parent. With this scheme, users can predict what items will be

contained in the main categories by determining the characteristics of the parent and

assume that all children will be more specific and unique in nature.

Hierarchies have been around for thousands of years and appear prominently in the

works of Aristotle and early scientists. The idea of constructing the hierarchy revolves

around two ideas: information can be organized in many ways but the two most common

ways hierarchies are constructed revolve around consensus of education or scientific

consensus. Respectively, these organize information around the manner in which they

are taught in education or in the way the subject is approached by specialists in the field

(Spiteri, 2000 ).

As the web has evolved into a very large, unorganized information source, many have

tried to construct hierarchies to provide access to users. Directories such as Yahoo! and

the Open Web Project have experienced various degrees of success and continue to

evolve. More importantly however is the use of the hierarchy as a navigation tool on

discrete sites. Hierarchies are many times embraced by designers because of their

familiarity to users and their ease of use. However, construction of hierarchies is usually

manual and can be time-consuming as well as expensive to create and maintain.

One electronic commerce site that has received attention from media and investors

alike is the auction site eBay. eBay is a marketplace where sellers post their items on the

site and buyers bid electronically on items. The items for sale are organized into a preset

hierarchy where sellers place their items according to their own determination. Buyers

can then browse through the categories of the hierarchy looking for items or can use the

simple search engine to retrieve items. The hierarchy itself contains twenty-four top level

categories with varying degrees of depth in each category. For example, the top level

category “Books” has ten second-level classes and each of those classes have anywhere

from four to thirty categories, with many of these having subcategories as each topic

becomes more and more specific.

The hierarchy is constructed with pre-defined categories by eBay staff. Sellers, when

posting the description of their item are asked to categorize their item in one of the eBay

categories and can pay and additional fee to have it listed in more than one category. The

hierarchy has undergone much transformation over the past years from twelve main

categories in 1997 to its current twenty-three. Categories that included many different

types of items such as “Books, Movies, and Music” have been separated into distinct

categories and new categories have been added to reflect new items such as automobiles

and real estate for sale in later iterations of the site.

This transformation indicates that eBay is continually monitoring the use of the

hierarchy and making adjustments where it sees fit. The importance of the hierarchy on

eBay is two-fold. The hierarchy can enhance browsing and lead buyers to serendipitous

discovery of items to buy or can find items of a similar nature to ones they may already

be interested in buying. The hierarchy, because of its straightforward nature and

familiarity to users can also serve new users who may not know enough about the site to

effectively use the search engines and their filtering abilities. Taxonomies can provide “a

means for designing vastly enhanced searching, browsing, and filtering systems. They

can be used to relieve the user from the burden of sifting specific information from the

large and low-quality response of most popular search engines (Chakrabarti, Dom,

Agrawal, and Raghaven, 1998 ).

Literature on Hierarchies and Browsing

Because browsing behavior is closely tied to the use of subject hierarchies, a

review of the literature of browsing as an information retrieval strategy can shed light on

why people use hierarchies and how they use them. With these studies, designers can

determine the best ways to construct these tools to meet the needs of the browser. In

addition to studies of user behaviors, much research has been conducted to effectively

create meaningful hierarchies automatically to reduce the time and expense generally

associated with constructing a subject hierarchy.

Browsing is an activity that has increasingly been studied as the World Wide Web

has emerged as a popular, accessible information tool. In 1997, browsing accounted for

61.27% of web activity (Fu and Salvendy, 2002). Browsing has been defined in many

ways in context of its purpose and its nature by many in the information science field.

Marchionini and Schneidermann (1988) who have done extensive studies of the user

interface and browsing define the activity as “an exploratory, information seeking

strategy that depends on serendipity. Elaine Toms (2000) defines browsing as ‘an activity

in which one gather information while scanning an information space without an explicit

purpose.” Strategies are heuristic and require users to recognize relevant information

through scanning and for determining interesting words which catch the eye (Cothey,

2002 Bates, 1996 ). O’Connor (1993) believes that the subject category does not support

browsing because ” browsing consists of a wide spectrum of idiosyncratic processes for

searching, sampling, and evaluating documents when significant attributes of a target or

goal are not full articulated or evident. When the librarian puts ‘related’ works nearby on

the shelf, they are said to be supporting browsing. Yet, by determining which

connections establish relatedness, they are supplying the pasture and tending to the user;

they are supporting grazing.” While scholars may differ on the particular nature of

browsing, many consistently cite user awareness of the information environment or the

nature of the task as being the underlying motivation in browsing as a user strategy

(Marchionini and Shneidermann 1988, Fu and Salvendy 2002, Toms, 2000, Cothey,

2002, Chen, Houston, Sewell, and Schatz, 1998, Chang and Rice, 1993, Qui, 1993).

Problems with browsing have also been well documented in the literature,

including missing pertinent items, longer search paths, missing pertinent items, and a

sense of feeling lost in the information space. Although some researchers believe that

hierarchies and browsing make “access easier and less-time consuming” (Griffiths,

1999), many studies have shown that the search time is longer than use of analytical

search tools such as search engines (Fu and Salvendy, 2002 ). Dennis et al. (2002)

contend that “direcctory based search using Yahoo does not seem to offer increased

relevance over keyword-based search and also takes longer .” Getting lost in hyperspace

is a well-documented phenomena where the user “does not have a clear conception of

relationships within the system, does not know his present location in the system relative

to the display structure, and finds it difficult to decide where to look next within the

system (Elm and Woods, 1985 ). Because of this disorientation, the user may experience

art museum phenomenon which is defined as “spending a great deal of time while

learning nothing specific” (Carmel, Crawford, and Chen, 1992 ). For new users to a

system, the likelihood they will miss pertinent information is demonstrated in the fact that

“users browse a selected topic using a depth-first search and miss other links at higher or

lateral levels” and that users familiar with the system may miss new items (Lai and Yang,

2000 ). Many designers have responded by providing concept maps, breadcrumb trails,

and site maps. These tools are used often when browsing and new users tend to use them

more than experienced users (McDonald and Stevenson, 1998).

Marchionini (1995) summarizes the limitations of browsing as follows: high

attention demand, inefficient for well-defined retrieval, possible distraction, possible

information overload, influences of various biases and cognitive inertia, systems not

designed to assist browsing, and subject to diminishing returns. Because browsing is

highly dependent on scanning the information environment, fatigue is also a major

limitation of browsing. Marchionini points to earlier studies by Wiberley and Daugherty

and Lee and Whalen where the upper limit of items users can browse without fatigue has

an upper limit of 200 items. In Lee and Whalen’s study of an automated mug shot

system, researcher’s concluded that accuracy fell rapidly between 100 and 200 images.

In addition to fatigue, browsers also fall prey to distractions and information overload, an

area where Marchionini believes is one of the areas where useful tools can be designed

for minimizing these problems. In an e-commerce setting, however, this remains to be

seen if this is an actual goal of the online retailer who may want to encourage impulse

buying and display his products strategically as he may in a physical store, designed to

catch the buyers eye and increase sales.

The hierarchy as a tool to assist browsers has been around for centuries. Much of

our classification in a hierarchy dates backs to Artistotle’s genus-species systems where

all species must contain the characteristics of the genus (Spiteri, 2000). Because of this

long history in information organization, many users are comfortable with this schema.

As Chen, Houston, Sewell, and Schatz (1998) note “browsing is frequently used in new

or relatively unknown information spaces, users typically rely on pre-existing mental

models of information organization as they explore. The most common mental models

are either alphabetically-based or based in hierarchical categories.” Kim and Allen

(2002) support the importance of the hierarchy as an information retrieval stating,

“Subject directories appear to be crucial to flexibility in information retrieval and helps

users with different characteristics and tasks find information they want.” Withstanding

all of the problems in browsing, the hierarchy is still a dependable tool that new users and

people unfamiliar with the search environment will find invaluable.

However, the goal of the designer should be to design the hierarchy in such a way

that the user intuitively understands the structure and will have little need of aids.

Drawing from the field of human-computer interaction, there are many guidelines offered

in the construction of hierarchies to make them more user-friendly and effective. The

breadth and depth argument regarding menu structure could possibly shed light on

designing hierarchies but there is inconclusive data to suggest that there is an optimal

formula for breadth of categories and depth of the hierarchy (Parkinson, Hill, Sisson, and

Viera, 1988). In most studies however, breadth is usually favored over depth, as long as

display crowding does not occur (Larson and Czerwinski, 1998). User studies with web

hierarchies of varying breadth and width still support the theory that breadth is better as

users in hierarchies with three levels performed poorly in experiments. However, the

same users also performed only slight better and complained of feeling lost in hierarchies

that had 32 categories at the top level (Larson and Czerwinski, 1998). In a large

hierarchy such as eBay or Yahoo!, it would be difficult to adequately represent the

information available by restricting the number of top categories and the depth to which

these categories could increase specificity of the topic.

There are characteristics that a well-defined hierarchy should have. Categories

should be mutually exclusive and homogenous (Spiteri, 2000 ). This means that

categories should not overlap when possible and that all items in a category should be

alike and share the characteristics of the class. Interfaces should “support scanning and

have good textual affordances. They must also provide salient cues that encourage

meandering and diversion which for ultimate success will facilitate serendipity” (Toms,

2000 ). Browsers must readily recognize topics and should be designed with novices in

mind and provide topics related to everyday, common sense knowledge (Carmel,

Crawford, and Chen, 1992).

Overall, the designer must keep in mind the needs of the user. As previously

stated, the traditional browser is either a novice searcher or does not have a well-defined

task in mind. As a result, the interface must recognize “the inherent ability of the human

eye to spot interesting words. The eye is much better than a computer for intellectually

scanning a mass of information, making associations among related or synonymous

topics, and choosing those of interest for further perusal (Bates, 1996 ). In the design,

the developer must design with visual recognition and spatial reasoning in mind as

opposed to linguistic specification (Chang and Rice, 1993 ). From the world of

information architecture come the recommendations that designers “recognize the danger

of overloading users with too many options, group and structure information at the page

level, and subject your design to rigorous user testing (Rosenfeld and Morville, 2002).

E-commerce activities at eBay also present additional problems in the sense that

topics generally do not lend themselves to one of the consensus model of education or

science. Categories are based loosely on products and services for sale that do not readily

align themselves with organization structures of traditional information retrieval

hierarchies. Miles, Howes, and Davies (2000) warn that “e-commerce should not be

assumed to be information retrieval” and that the interface must meet the needs of

consumer behavior which consist of need identification, product brokering, merchant

brokering, negotiation, purchase and delivery, and service and evaluation.

In his study of “The Dynamics of Mass Online Marketplaces”, Jungpil Hahn

(2001) discusses his idea of “market navigation design” which is a set of rules derived

from information architecture for a market setting. He discusses the aspects of breadth

versus depth, noting that fewer clicks with a breadth has the advantage of leading the

buyer quickly to a product but may result in more items meeting the buyers needs and

lead to information overload. Hahn also discusses the construction of hierarchies in

terms of the ability to list one item in more than one category. The advantage being an

increased likelihood of the buyer finding the single product versus confusion on the part

of the buyer due to inconsistent categorization.

In their study of browsing in a multidimensional framework, Chang and Rice

(1993) note that Bloch and Richins define browsing differently from a consumer behavior

point of view as “the in-store examination of a retailer’s merchandise for informational

and/or recreational purposes without an immediate intent to but. Consumer browsing

may be pleasurable in itself and can be done for various reasons. Again the impetus for

browsing seems to be a lack of goal-oriented search motivation and takes on an aspect of

recreation, instead of information retrieval.”

The results of browsing in an e-commerce situation are similar to those in a

traditional information system with minor differences. Besides enjoyment, browsers may

gain product knowledge and succumb to impulse buys (Chang and Rice, 1993 ).

Shoppers may also encounter new products whose existence of which they were unaware.

Whether these results do or should have any effect on the user’s experience with the

hierarchy remains to be seen.

Of further concern to those constructing hierarchies for web development is the

amount of time and personnel needed to create and update hierarchies manually. Chen, et

al (1998), argue for automatic means for constructing hierarchies because the “process of

creating the categories and connecting homepages to them is manual, slow, and

cumbersome”. Chen (2002), who has conducted research in creating classification

systems based on previous user activities notes that manual classification “can deal with

different types of data and can deal with different types of data and has high accuracy”.

However, he also notes that the “drawback of manual classification is the inefficiency in

dealing with a large number of objects in a complicated category structure” (Chen 2002).

Because of the exponential growth in the volume of on-line textual information it is

nearly impossible to maintain such taxonomic organization for large, fast-changing

corpora by hand (Chakrabarti, Dom, Agrawal, and Raghavan, 1998) because the

“semantic judgements required to select quality resources and the cognitive processes

necessary to classify and catalogue resources all take time (Worsfold , 1998). Another

issue to be considered is that the “human editor’s decision on taxonomy construction is

not only highly subjective but also its subjectivity is inconsistent over time (Kim and

Allen, 2002 ). Marchionini (1995) agrees by stating that “cognitive inertia, which is a

tendency to continue following paths or lines of evidence rather than examining contrary

or alternative directions” may exist. In response to this need to automate, there have

been many attempts at creating systems that will generate and classify documents into

hierarchies. Although these are many times applied to the results of a query or to a small

set of pre-defined documents, they are worth investigating.

Sanderson and Croft (1999) defined five principles of design that their

automatically generated hierarchy must contain: terms had to reflect the topics covered in

them, the organization had to be such that a parent term would refer to a more general

concept than its child, the child would cover a sub-topic of the parent, a strict hierarchy

where there was a one to one relationship between child and parent was not considered

important, and ambiguous terms would be expected to have separate entities in the

hierarchy. Selecting terms for the hierarchy if they are not the result of a query can be

problematic. According to Lawrie and Croft (2003), “topical terms are the content

bearing terms with respect to a particular topic. These are not necessarily the most

frequent words”. Instead they are terms which are “both about the topic and predictive of

other terms” (Lawrie and Croft, 2003).

Completely automated generated hierarchies generally fall into two categories,

those created through subsumption and those that are created lexically. (Lawrie, 2000 ).

Another approach to automatic classification is the automated analysis of documents filed

into pre-existing categories of a manually constructed hierarchy using Bayesian

classifiers. A drawback to this method is that they are not adaptable to varying interests

or to changes in the document collection (Lawrie, 2000). The other methods aim to

display the contents of a document collection and remain flexible.

Subsumption is the notion that some terms will frequently occur among

documents, while others will only occur in a few documents and that some of these terms

will be terms that broadly define a topic and that some will co-occur with a general term

and explain aspects of a topic (Lawrie, 2000 ). Terms are selected that are ‘unusually

frequent’ and sorted so that the top scores are designated for use in the hierarchy (Lawrie,

2000 ). The assumption here is based on Forsyth and Rada’s hypothesis that the more

documents a term appears in the more general the term is assumed to be (Nanas, Uren,

and DeRoeck, 2003). Once all of the subsuming relationships are discovered, the

hierarchy is constructed in a bottom-up fashion (Lawrie and Croft, 2001 ). Redundant

subsumptions and relations where terms only co-occur two or fewer times are eliminated.

Lexical hierarchies rely on the lexical nature of documents and relationships

between phrases and noun-adjective relationships are the basis for this method of

automatically creating a hierarchy. Nevill-Manning, Witten, and Paynter (1999) created a

lexically-generated hierarchy depends highly on the lexical notion of keywords in

context. Users are presented instances of phrases in which their keyword appear and can

browse from that point forward. However, because of the large amount of information

returned, the authors had concerns for it availability on the web due to design factors and

download times.

The lexical dispersion hypothesis states that “a word’s lexical dispersion-the

number of different compounds that a word appears in within a given document set- can

be used as a diagnostic for automatically identifying key concepts of the document set”

(Lawrie, 2000). Once phrases are identified and ranked and anomalies are identified in

documents that use a phrase frequently that could potentially skew the importance of the

phrase, the hierarchy is constructed in a top down manner. High level terms are selected

as the top level of the hierarchy and terms that occur with these terms are then ranked

according to the frequency in which they occur. (Lawrie, Croft, and Rosenburg, 2001 ).

Other research focuses around automatically classifying items into a preset

hierarchy. Many times the hierarchy is created manually to represent the information

space and various clustering algorithms are used to place the item in the correct level of

the hierarchy. One novel approach to this problem was introduced by Chen, LaPaugh,

and Singh (2002). These researchers analyzed access behaviors of users to determine the

paths they took through the sites in a one day period. The assumption was made that

pages that users traversed or visited were somehow related. Taking this information with

the known structure of the hierarchy, researchers were able to assess the functionality of

their hierarchy as well as predict topic placement based on the user patterns of a group of

users who had accessed the object.

The problems with automatically generated hierarchies generally result from the

size of the document set being processed and the nature of the algorithms chosen.

Subsumption seems to be the more effective tool for creating hierarchies and is used in

research of Croft, Lawrie, Roesenberg, Chakabarti, Nanas and others. Some of the

problems associated with subsumption is it inconsistency (Lawrie, Croft, and Rosenburg,

2001 ). Subsumption paid little attention to themes and documents and can be

demonstrate by a drop in relevant documents contained in the hierarchy (Lawrie, 2000).

Lexical hierarchies may possibly be skewed in the selection of topical terms

because the occurrence of terms in a long document may skew the importance of the

word in relation to the entire document set (Lawrie, 2000). In addition to some words

appearing more frequently in a long document, the idea that some words naturally occur

more often in phrases and sometimes result in useless or low-meaning phrases (Lawrie,

2000). Overall, “the strength of subsumption lies in separating documents into small

groups, whereas lexical hierarchies do a much better job of including all documents in the

hierarchy” (Lawrie, 2000). Other concerns exist in the computer’s ability to determine

specificity of a topic and to stop at a broader level of the hierarchy if necessary for

classification (Nanas, Uren, and DeRoeck, 2003).

The Current Hierarchy on eBay

The current hierarchy on eBay serves to facilitate browsing the items for auction

on the site. As an e-commerce site, eBay makes most of its revenue by providing access

to the products of sellers to a wide audience of buyers. As a result, the effectiveness of

its information retrieval devices in the form of its search engine and subject hierarchy is

of paramount importance. If buyers cannot find the items they wish to purchase, fewer

bids are placed, fewer sellers will list their items, and eBay revenue will fall.

The hierarchy serves two purposes that the literature has clearly defined. It can

support the novice user who may be uncomfortable constructing search phrases for the

search engine and using the various filters available in the advanced search area to

narrow the results of said queries. Novice users may be more comfortable browsing

through the categories of interest to get a feel for the information space and to see the

breadth of items available for purchase.

The other purpose the hierarchy may satisfy is the buyer without a specific goal in

mind. This might be a buyer who is “window shopping” as in a traditional brick and

mortar store, with no purpose of buying, but looking for serendipitous discovery,

pleasure, or for product information gathering. In this case, the searcher could be both

potential sellers and buyers.

The hierarchy itself does not follow one of the classic modes of hierarchical

organization as Spiteri (2000) predicted. There is not an apparent scheme for organizing

a very broad array of products for sale in an educational or scientific way. Instead, eBay

has picked broad categories of items for sale and put those at the top level. Sublevels or

categories maintain the parent-child nature of a hierarchy and are constructed mainly on

an increasing degree of specificity as one meanders down a hierarchical path. Main

topics are arranged alphabetically and currently number twenty-three.

The structure of the hierarchy is determined by eBay personnel and a request for

insight into the criteria for inclusion as categories was not answered. An area to request

new categories exists in the help section but there is no indication given how requests are

evaluated and eventually incorporated into the hierarchy or ultimately discarded. One

would surmise that as categories grew in membership they would be analyzed for new

subcategories. However, this is difficult to confirm since many of the lowest levels of the

hierarchy still have a large number of entries. For example, the category

BooksMysteries and ThrillersDetectivesWomen Sleuths still contains over 2,000

items. An analysis of this path in the hierarchy suggests that the addition of new

categories is haphazard and directly related to user requests. Other subcategories in the

detective path include two other options, short stories and other. There is no relation to

each other and distinguishing where an item might fall is difficult. In this example, an

book of short stories featuring female detectives would be difficult to classify. At this

low level of high specificity, one could argue that the effectiveness of this level is

compromised if an item is place in two of the three options.

When analyzed in relation to Spiteri’s model for hierarchies (2000), eBay falls

short in many of the areas of constructing an effective hierarchy for e-commerce. eBay

generally fails to provide homogenous categories, mutually exclusive categories,

consistent structure, and provide very few tools for comparing products. The hierarchy

seems to be haphazardly constructed without little thought to the principles of designing

hierarchies. Because eBay is not burdened by the tasks of placing documents in the

correct level of the hierarchy, there should be more attention paid to its construction for

more efficient browsing.

The main issue where homogeneity fails in the ubiquitous “Everything else”

category. Second level topics such as genealogy, health and beauty, metaphysical, and

personal security have little to do with the other. It is in this area that the hierarchy

provides no context to the buyer and categories such as “Test Auctions” are difficult to

define and determine the purpose. Because the rule of homogeneity has been ignored,

this area is really just a chaotic mish mash of items and do not benefit from the many

positive attributes of hierarchical browsing. Within this category there are also

meaningless category headings that do not have well defined characteristics for

membership in the class. The best example being “Weird Stuff”. This vague labeling

combined with its location in a poorly constructed node of the hierarchy make it difficult

for the user to predict what items or subcategories may be included.

Exclusivity of the categories in eBay is one of the main drawbacks to the current

structure. Because there is no apparent plan for designing the categories of the hierarchy

and for incorporating new categories, the problem of exclusivity plagues the hierarchy at

all levels. Examples of this problem are easily identified at the top level of the hierarchy.

One might wonder how dolls and bears is an exclusive category of toys and hobbies. The

existence of a top level categories for stamps and one for coins is also confusing. In

relation to the other categories at this level, one specific item that is widely collected

seems incongruous with the rest of the organizational scheme. Stamps and coins may fit

better in categories such as collectibles or hobbies. Upon seeing these inconsistencies in

labels, the user may be confused as what to expect in the hobby section if stamps and

coins are not included.

Vague labeling also results in categories that appear to be inclusive or

heterogeneous. Categories such as “Collectibles” give no indication of what might be

included, especially when other categories at the same level such as antiques, stamps,

coins, toys and hobbies, dolls and bears, and pottery and glass seem to over a wide area

of what might be expected to fall in the collectible category. Because the hierarchy is

arranged alphabetically, a novice browser may scan the hierarchical tree and see

“Collectibles” first and never get to the lower levels where collectible items like

glassware and stamps are included.

Along with the vague labeling is an inconsistency of specificity at the different

levels. As mentioned before there is a contrast in specificity between categories such as

the generic collectibles and the specific collectible stamp. Other category names are even

more broad and troublesome for the novice user or the browsing buyer. Under the home

category one might be surprised to find pet supplies and welding but not health and

beauty. One might be surprised to find items such as sewing machines under “Business

and Industry” instead of “Clothing, Apparel, and Accessories”, which incidentally does

not include the most common category of accessories, jewelry.

From a usability standpoint, eBay attempts to give browsers a glimpse of the

information space in which they can shop. The broad categories do invite users to

explore the possible items for sale in the hierarchy. However, the presentation of the

hierarchy and its apparent usability could potentially affect the browser’s satisfaction as

predicted by Fu and Salvendy (2002).

In addition to labeling problems there are other problems with the usability and

organization of the hierarchy. The space allocated to the hierarchy is not sufficient to

allow for ease of scanning which is so important to effective browsers as discussed

earlier. Only one-fourth of the screen is dedicated to the hierarchical listings. In this

space that measures approximately two inches by eight inches on a monitor set at

800x600, eBay has inserted the twenty three top levels as well as a random sampling of

subcategories numbering thirty eight. As a result, the text is very dense, difficult to

recognize as separate entities in a glance, and the relative font sizes used to accomplish

this density of information are hard to read. From a design standpoint, this does not

encourage “the inherent ability of the human eye to spot interesting words”(Bates, 1996) .

Because of the manner in which the hierarchy is presented there are additional

concerns with the usability of the site. As mentioned previously, eBay has also included

random subcategories underneath the main categories on the front page. It is difficult to

determine what criteria were used to choose these subcategories that receive more

prominent display than other categories at the same level of the hierarchy. This

inconsistency could lead novice users to miss the other subcategories further down the

hierarchy or to become frustrated when it appears that the item they are looking for

doesn’t fit into one of these lower categories that are displayed on the front page.

The final usability issue to be discussed is the problematic way in which eBay has

presented the lowest levels of the hierarchy. Once a user picks a main category from the

front page of the site, they are presented a page with all of the second-level categories.

When a user selects one of these options, they are then presented with the items that fall

in that level of the category. Because this includes all items that stop at this level of

specificity as well as all of the children items that are inheriting the parent class, the

number of items returned is immense. The resulting set is usually too large to navigate

easily. What many users might not notices is that eBay has continued the hierarchical

structure for the category in the right column under the search box. Because this area is

so cluttered with search options, the more specific subcategories are not even viewable on

the first screen of a standard monitor set at 800x600. The user would have to scroll down

to see the continuation of the hierarchy and many novice users will not know to do this

and will lose the power of the hierarchy as an information retrieval tool.

There are some things that eBay has implemented to address problems with

browsing and hierarchies. At all lower levels, breadcrumbs are provided where browsers

can easily jump up one or two levels in the hierarchy without having to return all the way

to the homepage. This represents an attempt to keep the user from feeling lost which is

well documented in browsing literature. An overview map also provides browsers a

limited view of the top levels of the hierarchy with more detail than the frontpage for

constructing a better mental map of the entire information space. Also eBay must

recognize some of its fallacies in the construction of its hierarchy because on most of the

pages where second levels of the hierarchy are presented, they also provide a listing of

“Related categories” to encourage browsers to discover items in other areas that might

rightly belong in the category they are currently viewing. For example, if a user selects

Clothing, Shoes, and Accessories as the main topic, the related categories at the bottom

of the page would also direct them to other categories such as Jewelry and Health and

Beauty.

Because eBay does not manually classify all of the items in the database using

their own personnel, the need to automate classification schemes is not as important to

the company as it may be to other internet based hierarchies. The time issues is

adequately solved as sellers provide the category for the item as part of the process of

listing it on eBay. However to improve the quality of product placement by sellers, eBay

may consider some of the algorithms for testing the quality of its hierarchy or offer

assistance to sellers in deciding which category may be best for a seller, especially novice

sellers who may not be familiar with the entire scope of the selling environment. Chen’s

analysis of previous user traffic to certain categories or document clustering techniques

used in Lawire’s (2000) discussion of hierarchies may provide insight into the

organizational scheme of eBay. Once the seller has submitted the description of their

item for sale, an analysis of the content could be compared to other items in categories or

to model items in categories for suggested placement. From eBay’s standpoint this also

may be a money making idea as it may encourage sellers to list their item in multiple

categories as they are being presented potential selling areas they had not previously

considered.

On a related note, the many automated techniques for creating hierarchies could

potentially be applied to improve granularity of the lower levels of the hierarchy. Even

at the lowest levels, there are still large membership sets for certain levels. For example,

in the category of Toys & Hobbies > Action Figures > Star Wars > Vintage (1977-89)

>Return of the Jedi > Figures, there are still over 500 items for sale in this very specific

category. From a browsing standpoint, this number may still overwhelm the user and

they may not be able to find the item they were looking for. eBay could potentially

expand the depth of the hierarchy but this may make it more problematic for sellers to

place items in the correct categories or discourage browsing buyers from continuing

down a long path. Also maintaining these categories at this level of detail could also be

time consuming and expensive for eBay. For a person to sit down and derive categories

at this level, say by character name, would require quite a lot of attention to detail and

expand the structure of the hierarchy significantly. However, eBay could employ one of

the automated hierarchy schemes to group items at the low, detailed levels for those

searchers who do have more specific needs. Either through clustering or lexical analysis,

users could be presented with additional navigation aids that generate new categories by

term clustering. So in this example, the automatically generated hierarchy might contain

a list of characters’ names that appear in descriptions. This could also be added as an aid

to the search engine where users would get a hierarchical listing of categories their search

terms were contained in.

Improvements on eBay

In response to the literature and some of the problems previously discussed, eBay

needs to dedicate the time and the money to revamp its hierarchy and optimize it for new

browsers and for recreational shoppers. Organizational changes as well as usability

issues need to be addressed and finally the new interface needs to be tested for usability.

Organizationally speaking, eBay needs to refine its hierarchy to meet the two

important aspects of hierarchical design, exclusivity and homogeneity. Categories at the

top level that have established child relationships to categories at the top level must be

subsumed by the parent category. According to Patricia Billingsley (1982), “problems

created by ambiguity in menu choices are compounded when users are unaware of the

hierarchical structure of data organization”. To make the structure less ambiguous, the

hierarchy should adhere to the strict parent/child relationships that are established in the

organizational structure.

The second recommendation would be to make the entire structure of the

hierarchy viewable in the “View all eBay categories” section or in the site map. All of

the categories do not need to be on one page, but users should have the ability to browse

through the entire depth of the hierarchy assigned. The idea that a seller places the item

in a lower category in the hierarchy where the user might never encounter due to the

constraints of the design. If eBay has determined that the hierarchy is necessary to

facilitate browsing and window shopping behaviors, then it needs to make the entire

information space accessible through the hierarchy.

Once these changes to the organizational structure have been user testing would

need to be conducted to ensure that the recommendations from the literature are effective

in this environment. Because browsing behavior is difficult to study and quantify

because of its lack of directed goal, the design of the study would need to be designed to

emulate real browsing situations. Elaine Toms (2000) notes that “few studies have

assessed browsing, analyzed the characteristics or factors that influence browsing,

experimentally manipulated the browsing environment or attempted to measure browsing

outcomes.”

One possible way of studying the changes to the interface would be to have users

spend a set amount of time browsing for items they might be interested in buying,

Instruct them that there is no goal in mind, simply to spend time looking at items as if

they were “just looking” in a store. With two groups of browsers, one group could use

the old interface and one group could use the new interface. At the end of the study, both

groups could answer a survey on user satisfaction centered on the ideas of apparent

usability and compare results with those of Fu and Salvendy ( 2002 ).

Another approach would be to conduct a traditional user study and ask one set of

users to browse certain categories for certain types of items in each of the interfaces and

ask them to speak their thoughts as they navigate through the structure. Comments

would be recorded by investigators and analyzed for information at a later date. Also

times would be recorded for each time a user encountered an item they might consider

buying. Because this is highly subjective to individual differences, a large sampling of

users would be needed to get a better idea of how effective this benchmark would be.

From a different perspective another interesting aspect to investigate would be to

tests the cognitive models of different hierarchies and see if hierarchies in an e-commerce

setting were negatively affected by the lack of a mental model that is many times

establish by the models of education and science as discussed by Spiteri ( 2000). Users

could be given a specific item to locate in each of the hierarchies at the same depth in the

hierarchy. For example, the search task may be something like find a website on the

platypus using the Yahoo! directory and a DVD of a pop band on eBay. Both of these

items are five levels deep in the hierarchy. The same user will be asked to browse

through the hierarchy with the loosely designed search task and users will be evaluated

on time and accuracy in identifying the resources requested. Results will be compared to

see which hierarchy performed better and qualitative data from user responses will be

analyzed for any information regarding feelings of lostness or disorientation. Information

on which hierarchy was more usable from the user’s standpoint may shed light on which

hierarchy is preferred if any and why.

Conclusions

As demonstrated through the literature and analysis of the eBay site, e-commerce

does bring new challenges to the designer to implement traditional information retrieval

mechanisms in an online commercial setting. The use of the hierarchy to present options

to the potential buyer is still a solid decision because the hierarchical scheme is such a

familiar cognitive model for potential buyers. However, the traditional ways of creating

hierarchies must be re-examined with the user in mind. Categories of commerce do not

easily align themselves with the traditional educational and scientific models of creating

hierarchies. Instead, designers must be innovative and aware of their customer’s

expectations to design efficient hierarchies. User-testing and periodic data collection will

insure that the categories of the hierarchy accurately depict the information space, in this

case, the products available for sale.

With over $45 billion in online sales in 2002 (Business Journal Online, ), e-

commerce retailers must strive to make their site easy to use and to locate items if they

are to remain competitive. The five aspects of consumer behavior outlined by Miles and

Howes must be met using adaptations to the traditional ways of constructing hierarchies.

More studies need to be conducted on how online consumers browse in relation to their

traditional information seeking counterparts. A difference in search task and context of

search may lend itself to new design standards for online commerce sites to increase

usability and access for online buyers and sellers.

One additional area of study to pursue is the study of browsing itself.

Methodologies and evaluation tools need to be developed to evaluate user’s behaviors in

browsing situations. This presents many difficulties due to the nature of the task. Users

generally do not have a well-defined information need or a strongly-formulated idea of

what they are looking for. In a quantitative study, these types of motivations are difficult

to quantify due to the large influence of personal preferences, emotions, and motivations.

Equally troubling is the lack of a formulated query as noted by Dennis, Bruza, and

McArthur (2002.) Marchionini (1995) and Bates (1989) would argue that browsing is a

constant reformulation of a search query, however loosely understood by the searcher.

Regardless of which view one subscribes to, the environment is difficult to study in a

laboratory setting. The motivation for browsing in a laboratory setting may harder to

emulate than task-oriented searching because a browser generally will look for

information when he has time and the inclination to do so. In a task-oriented search,

participants have well-defined goals to reach and will try to reach that goal. When no

goals are given, some participants may flounder and not exhibit the behaviors the

investigators are trying to study.

All of these issues are critical concepts, worthy of further study due to the nature

of browsing and the growth of electronic commerce. Browsing is an everyday activity

that everyone participates in whether it be scanning documents for information, using the

yellow pages to look up a telephone number, or locate an item on an online auction.

Combined with a natural tendency to categorize items according to hierarchical

relationships, browsing is a behavior that information retrieval specialists as well as e-

commerce designers must understand in order to optimize the online experience for users.

Hierarchies and eBay

Documents