[go: up one dir, main page]

US20130144862A1 - Systems and Methods for Clustering User Reviews - Google Patents

Systems and Methods for Clustering User Reviews Download PDF

Info

Publication number
US20130144862A1
US20130144862A1 US13/754,646 US201313754646A US2013144862A1 US 20130144862 A1 US20130144862 A1 US 20130144862A1 US 201313754646 A US201313754646 A US 201313754646A US 2013144862 A1 US2013144862 A1 US 2013144862A1
Authority
US
United States
Prior art keywords
reviews
review
product
user reviews
electronic documents
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US13/754,646
Inventor
Jan Matthias Ruhl
Mayur D. Datar
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to US13/754,646 priority Critical patent/US20130144862A1/en
Publication of US20130144862A1 publication Critical patent/US20130144862A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • G06F17/30864
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9538Presentation of query results
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/954Navigation, e.g. using categorised browsing
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0278Product appraisal
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0282Rating or review of business operators or products

Definitions

  • the disclosed embodiments relate generally to search engines. More particularly, the disclosed embodiments relate to methods, systems, and graphical user interfaces for finding, aggregating, and providing reviews for a product.
  • users may go to Web sites that provide reviews and ratings for products, such as www.amazon.com, www.epinions.com, and www.circuitcity.com.
  • a few sites such as www.rottentomatoes.com, movies.yahoo.com, and www.consumersearch.com, aggregate reviews for the same product from several sites.
  • some of these sites aggregate reviews manually, which is very inefficient and limits the number of reviews that can be gathered for each product.
  • all of these sites are very limited in the tools they provide users to navigate through the reviews.
  • a method comprising: at a computer system: extracting a plurality of user reviews from electronic documents, wherein the electronic documents contain user reviews of a plurality of items of interest; identifying a set of user reviews in the plurality of user reviews as being associated with the same item of interest in the plurality of items of interest, using item identifying information included in the electronic documents; and associating the set of user reviews with the same item of interest.
  • the item identifying information includes one of: a unique product identifier, brand name, model number, category information.
  • the item identifying information is extracted from metadata included in the electronic document.
  • the electronic documents are obtained from e-commerce websites or product-review websites.
  • the embodiments disclosed herein include new, more efficient ways to collect product reviews from the Internet, aggregate reviews for the same product, and provide an aggregated review to end users in a searchable format.
  • product refers to both products and services. Moreover, “product” encompasses virtually any product, service or combination thereof that can be bought, leased, rented, or similarly obtained. Exemplary products include, without limitation, consumer products, business products, movies, books, restaurants, hotels, and travel packages.
  • FIG. 1 is a block diagram illustrating an exemplary distributed computer system according to an embodiment of the present disclosure.
  • FIG. 2A is a block diagram illustrating an aggregated reviews backend server 102 in accordance with one embodiment of the present disclosure.
  • FIG. 2B is a block diagram illustrating an aggregated reviews frontend server 100 in accordance with one embodiment of the present disclosure.
  • FIG. 3 is a flowchart representing a method for collecting and aggregating reviews for a product according to one embodiment.
  • FIG. 4 is a flowchart representing a method for displaying an aggregated review for a product according to one embodiment.
  • FIG. 5 is a flowchart representing a method for searching aggregated reviews for a product according to one embodiment.
  • FIGS. 6A-6D are schematic screen shots of an exemplary graphical user interface of a product information module ( FIG. 6A ) and a product review presentation module ( FIGS. 6B-6D ) according to one embodiment.
  • FIG. 1 is a block diagram illustrating an exemplary distributed computer system according to an embodiment disclosed in the present disclosure.
  • This system includes client computer 104 , aggregated reviews backend server 102 , aggregated reviews frontend server 100 , multiple Web sites such as Web sites 108 and 110 , and communication network(s) 106 for interconnecting these components.
  • Client 104 includes graphical user interface (GUI) 112 .
  • Aggregated reviews backend server 102 collects product reviews from multiple Web sites 108 - 110 , identifies particular products that are associated with particular product reviews, generates aggregated review information for particular products, and stores the product reviews and the aggregated review information.
  • GUI graphical user interface
  • Aggregated reviews frontend server 100 receives and responds to requests from client 104 to provide an aggregated review for a product and/or to search within reviews for a particular product.
  • GUI 112 includes portions of a plurality of reviews for a product and a search input area for entering search terms to search for reviews of the product that contain the search terms.
  • FIG. 2A is a block diagram illustrating aggregated reviews backend server 102 in accordance with one embodiment of the present disclosure.
  • Server 102 typically includes one or more processing units (CPU's) 202 , one or more network or other communications interfaces 204 , memory 206 , and one or more communication buses 214 for interconnecting these components.
  • Server 102 optionally may include a user interface 208 comprising a display device 210 and a keyboard 212 .
  • Memory 206 may include high speed random access memory and may also include non-volatile memory, such as one or more magnetic disk storage devices.
  • Memory 206 may optionally include one or more storage devices remotely located from the CPU(s) 202 .
  • the memory 206 stores the following programs, modules and data structures, or a subset or superset thereof:
  • the reviews index 250 maps terms (e.g., words and phrases) to reviews. In some embodiments, the reviews index 250 also maps other values, such as ClusterIDs (i.e., product identifiers) or review author names, to the corresponding reviews.
  • ClusterIDs i.e., product identifiers
  • review author names i.e., review author names
  • FIG. 2B is a block diagram illustrating aggregated reviews frontend server 100 in accordance with one embodiment of the present disclosure. The aspects of FIG. 2B that are analogous to those in FIG. 2A are not described again here.
  • the memory 276 in frontend server 100 stores the following programs, modules and data structures, or a subset or superset thereof:
  • the reviews database 242 and reviews index 250 are generated by the backend server 102 , and are copied from the backend server to the frontend server 100 for responding to user requests for product review information.
  • each of the above identified modules and applications corresponds to a set of instructions for performing a function described above.
  • These modules i.e., sets of instructions
  • memory 206 or 276 may store a subset of the modules and data structures identified above.
  • memory 206 or 276 may store additional modules and data structures not described above.
  • FIGS. 2A and 2B show servers 100 and 102 as a number of discrete items
  • FIGS. 2A and 2B are intended more as functional descriptions of the various features which may be present in servers 100 and 102 rather than as a structural schematic of the embodiments described herein.
  • items shown separately could be combined and some items could be separated.
  • some items shown separately in FIGS. 2A and 2B could be implemented on single servers and single items could be implemented by one or more servers.
  • the actual number of servers used to implement backend server 102 and frontend server 100 and how features are allocated among them will vary from one implementation to another, and may depend in part on the amount of data traffic that the system must handle during peak usage periods as well as during average usage periods.
  • FIG. 3 is a flowchart representing a method for collecting and aggregating reviews for a product according to one embodiment.
  • the process shown in FIG. 3 is performed by aggregated reviews backend server 102 ( FIGS. 1 and 2A ). It will be appreciated by those of ordinary skill in the art that one or more of the acts described may be performed by hardware, software, or a combination thereof, as may be embodied in one or more computing systems.
  • Backend server 102 collects product reviews, or information containing product reviews for a plurality of products ( 302 ).
  • collecting information comprises selectively crawling review-related Web sites or portions thereof and retrieving information containing product reviews.
  • Selective crawling differs from traditional crawling in that the crawler only follows selected links on Web pages, rather than all links.
  • crawling module 220 To initiate a crawl, crawling module 220 is given a set of starting (sometimes called “seed”) URLs (Uniform Resource Locators, i.e., web addresses) to crawl.
  • seed Uniform Resource Locators, i.e., web addresses
  • Crawling module 220 puts these URLs in queue 222 .
  • Crawler 226 continuously fetches pages corresponding to the URLs in the queue 222 until either there more no more URLs in the queue 222 , or a crawl termination condition is reached.
  • link parser 224 tells crawler 226 how to extract new links to follow from a fetched page by matching regular expression patterns.
  • link parser 224 tells crawler 226 how to extract new links to follow from a fetched page by using text classification techniques.
  • Table 1 contains pseudo code for a link parser 224 that selects which links to follow on an exemplary website (www.productcompany.com) for reviews of electronics products.
  • Crawler 226 outputs the crawled pages to fetched pages buffer 228 .
  • the fetched pages buffer may contain one or more files or other data structures.
  • link parser 224 parses the page and attempts to extract links (URLs) to follow that, if found, are added to queue 222 .
  • URLs links
  • parser 224 may identify such links based on the presence of terms or patterns in the URLs of the links, or based on the anchor text of the links.
  • Anchor text is text contained in or otherwise associated with a link.
  • the information containing product reviews can be collected from a wide variety of sources, including web pages and forum postings.
  • the web pages comprise Web pages of review-related Web sites and/or product-related Web sites.
  • collecting information comprises receiving product reviews in a predetermined digital format, for example as data streams from multiple product reviewers.
  • the predetermined digital format of the product reviews may comprise key/value pairs, or data organized in predefined data structures.
  • collecting information comprises selecting information from a repository of information collected by a web crawler.
  • Backend server 102 automatically extracts product reviews from the collected information ( 304 ).
  • review parser 232 which is analogous to link parser 224 , extracts content pertaining to a review (e.g., the review text, author, and date), as well as content that identifies the product that the review is about (e.g. the product name, model number, or other features), from the fetched pages in buffer 228 .
  • the extracted information is stored in extracted reviews buffer 234 .
  • extracted reviews buffer 234 contains a list of key/value pairs denoting the extracted product review information, such as the review author, content, publisher, and so on.
  • each extracted review found on a web page is encoded and stored in a respective review record in the extracted reviews buffer 234 .
  • Table 2 lists the fields of each review record in the extracted reviews buffer 234 in an exemplary embodiment.
  • each review record may have a subset of the fields listed in Table 2, and may also include additional fields.
  • the review records might not include the ReviewType and CrawlDate fields.
  • the Rating field may be a numeric field, instead of a string field.
  • each review record may include both a string Rating field and a normalized numeric rating field (e.g., normalized with respect to a rating scale of 0 to 1, or 0 to 5).
  • the keyword “required” indicates a required field
  • “repeated” indicates a field that may occur more than once in a record (and in rare cases, may have zero occurrences)
  • “optional” indicates an optional field.
  • Optional fields may be present in some review records, while not present in other review records, for example because some reviews may not include the optional information.
  • the data structure fields shown in Table 2, except ReviewID, ReviewType, QualityScore and ClusterID, are filled with data from the extracted review information if information relevant to those fields are found in the extracted review information.
  • the ReviewID is a unique identifier assigned to each individual review by the aggregated reviews backend server.
  • the ReviewType indicates whether the review is an editorial review, or a review submitted by a user.
  • QualityScore which provides a measure of the quality of a review, is determined based on the review's length and word entropy (which measures, among other things, the diversity of the words or terms used in the review). As explained below, all reviews belonging to the same product form a “cluster” and the ClusterID identifies the cluster to which a review belongs.
  • Backend server 102 automatically identifies particular products that are associated with particular product reviews ( 306 ). For at least some of the extracted product reviews, backend server 102 identifies a particular product that is associated with the extracted product review.
  • identifying a particular product that is associated with the extracted product review comprises associating a unique number or string in the extracted product review with a particular product.
  • the unique number or string is a product model number.
  • the unique number is an ISBN number.
  • identifying a particular product that is associated with the extracted product review comprises associating a brand name and a model number in the extracted product review with a particular product. In some embodiments, identifying a particular product that is associated with the extracted product review comprises associating a brand name, a model number, and a product category in the extracted product review with a particular product.
  • identifying a particular product that is associated with the extracted product review is simple if the product review includes the global identifier.
  • extracted brand names and model numbers in the above example, the brand name is “canon”, and the model numbers are “lide30” and “7890a002”
  • category information such as the fact that the above product is a scanner
  • reviews about the same product form a “cluster” and “clustering” refers to identifying reviews for the same product.
  • this information is stored in review records (Table 2) by review extraction module 230 .
  • Extracted brand names, model numbers and category information may all be stored in the ProductNames fields of the review records.
  • each review record may include a Model field for storing a model number, a Category field for storing product category information, and a Names field for storing brand name information for a respective product.
  • Model, Category and Brand Name are not explicitly identified on a web page.
  • this information is deduced from the title string or other fields of the extracted review.
  • product category and model number can be inferred by the review parser 232 or the review aggregating module 236 as follows. Given a title string, the part of the title string beyond words like “for”, “with”, and “featuring” is ignored. Words in the title string are traversed from right to left and the first word (or pair of words) that is included in a data file containing a predefined list of product categories is considered the product category.
  • this data file also identifies, for each distinct product category, a list of other product category names for the product category that are mapped to one canonical name. For example, “notebooks”, “notebook”, “laptop” or “laptops” may all be mapped to the canonical name, “notebooks”. Thus, a title string such as “Canon canoscan lide 30 color scanner with free extras” would be mapped to the category “scanner”.
  • model numbers the title string is traversed from left to right and the first word that contains digits (i.e., characters from ‘0’-‘9’) is concatenated with the preceding word to form a model number.
  • the resulting model number is discarded if it matches any of the regular expression patterns corresponding to blacklisted model numbers or contains a brand name or parts of a brand name.
  • An example of a blacklisted pattern is “pentium ⁇ d+” +” (i.e., “pentium” followed by a string of digits).
  • “lide 30” is extracted as the model number. Note that more than one model number can be inferred for a product name.
  • the brand names are identified by matching words in the product title with known (i.e., predefined) brand names in a list or dictionary of such brand names.
  • the brand name of a product may be extracted from the URL of the page on which the product review is found.
  • two product reviews are assigned to the same cluster (product) if they have the same exact brand name, and agree on their model numbers and categories.
  • Two reviews are said to agree on a model number if the set of model numbers extracted for them have a non-empty intersection.
  • Two categories are said to agree if they are the same or at least one of them is unknown.
  • two product reviews are assigned to the same cluster (product) if they have the same exact brand name and agree on their model numbers.
  • reviews corresponding to products for which brand or model information cannot be determined are put into a singleton cluster of their own and are not clustered with any other reviews.
  • review aggregating module 236 fills in the ClusterID field in the review record for the extracted review.
  • backend server 102 For each particular product in at least a subset of the plurality of products, backend server 102 generates aggregated review information for the particular product based on a plurality of extracted product reviews that are associated with the particular product ( 308 ).
  • review aggregating module 236 uses the reviews for a particular product to generate aggregated review information for the product that is stored in aggregated review buffer 240 .
  • the aggregated review information for the particular product comprises a total number of reviews for the product, an average rating for the product, a distribution of the ratings for the product, and/or frequently appearing phrases in the extracted product reviews associated with the product.
  • One or more additional types of aggregated review information may be included in the aggregated review information in other embodiments.
  • the average rating is the average of weighted ratings from different web sites.
  • the weight of a single review on a web site with K reviews for the same product is 1 /K.
  • the weight given to reviews from a particular web site or reviewer can be personalized by a user. For example, a user can specify via a preferences dialog box one or more web sites (e.g., www.consumerreports.org) and/or reviewers (e.g., Roger Ebert) preferred by the user.
  • the ratings given in the reviews associated with the user's preferred sources are then given higher weight when computing an average rating for a product.
  • the ratings given in reviews associated with the user's preferred sources are given N times the default weight assigned to those reviews, where N is a value such as 2, 2.5, 3 or any other appropriate value greater than 1.
  • An exemplary representation of the average rating computation is:
  • a i represents the weight given to each rating, rating i , of each respective review i.
  • frequently appearing phrases in the extracted product reviews associated with a particular product are computed by review aggregation module 236 as follows. For all the phrases (e.g., unigrams, bigrams or trigrams that do not span across sentences) that occur in reviews associated with a product (cluster), the number of their weighted occurrences is counted. Occurrences in the review title or summary are given a higher weight. When computing or selecting frequently appearing phrases, all stop words and words that occur in the title of the product name are ignored. Each phrase is given a score equal to the length of the phrase times the square root of the sum of weighted occurrences of the phrase. A predetermined number of phrases with the highest scores (e.g., the top 6, 7, 8, 9 or 10 phrases) that do not overlap each other are chosen as the frequent phrases and stored in aggregated review buffer 240 .
  • scores e.g., the top 6, 7, 8, 9 or 10 phrases
  • Table 3 lists the fields of each aggregated review record, produced by aggregating information from all the extracted reviews for a particular product, and stored in the aggregated reviews buffer 240 .
  • each aggregated review record may have a subset of the fields listed in Table 3, and may also include additional fields.
  • a separate instance of the aggregated review record is included in the aggregated reviews buffer 240 and in the reviews database 242 for each distinct product.
  • each aggregated review record includes a ClusterID (identifying the product associated with this record), a list of ReviewID's (each of which is uniquely associated with a review of the product), a list of the words or phrases most frequently found in the reviews of the product, and a computed average rating of all the reviews for the product.
  • an aggregated review record also includes one or more of the following additional fields: a string identifying the product, a number specifying the number of reviews in the cluster; and data concerning the distribution of ratings in the reviews for a product, such as the number of reviews with low, medium and high ratings, as well as the number of reviews having no product rating.
  • Backend server 102 automatically stores the extracted product reviews 244 and the aggregated review information 246 in reviews database 242 ( 310 ).
  • each stored product review in the reviews database 242 comprises a record that includes the same information as the corresponding review record (Table 2) stored in the extracted reviews buffer 234 , while in other embodiments, the stored product review may comprise a subset of the information stored in the corresponding review record.
  • reviews indexer 248 indexes the individual product reviews 244 and the aggregated review information for a product 246 and stores the results in reviews index 250 .
  • Client 104 sends a request for an aggregated review for a product ( 402 , FIG. 4 ).
  • the request is sent in response to a user-initiated activity, such as a user clicking on a hyperlink.
  • a user looking at web page 650 with various products for sale could activate hyperlink 652 to see an aggregated review for product 654 .
  • frontend server 100 receives the request ( 404 ) for an aggregated review for a product and obtains ( 406 ) the aggregated review.
  • product information 254 in product database 252 is linked to aggregated reviews for the corresponding products by product-to-review map 256 .
  • map 256 contains mappings of product identifiers used by a first website or service (e.g., a product information website) to the ClusterIDs used by the frontend server 100 .
  • the process of assigning product identifiers to products in the first website or service may be analogous to the process described above for identifying particular products that are associated with particular product reviews.
  • map 256 is determined by matching a global identifier for a particular product (e.g., an ISBN number) in product database 252 with a corresponding review cluster.
  • Product-to-review map 256 is used to identify the aggregated review (in the reviews database 242 ) corresponding to a product.
  • Presentation module 264 formats the aggregated review.
  • the reviews index 250 is used to identify the individual reviews corresponding to the aggregated review for a product. More specifically, the reviews index 250 maps the ClusterID of the aggregated review to the individual reviews for the product associated with the aggregated review.
  • Frontend server 100 sends ( 408 ) the aggregated review to client computer 104 via communications network 106 .
  • Client 104 receives ( 410 ) and displays ( 412 ) the aggregated review 600 ( FIG. 6B ).
  • aggregated review 600 includes portions of reviews 626 for the product and a search input area 618 for entering search terms to search for reviews of the product that contain the search terms. Search input area 618 is used to enter search terms for a search that requests only results that are within the reviews for the product.
  • aggregated review 600 includes a histogram 608 of review ratings for the product, an aggregate rating 604 for the product, and/or the number 606 of reviews for the product. In some embodiments, aggregated review 600 is displayed in a single Web page.
  • review search module 258 supports two different types of queries: (1) search within the reviews 262 belonging to a single product (e.g. a search for “customer service” in the reviews for a given product) and (2) search all reviews 260 to find a particular product (e.g. a search for “good digital camera”).
  • Aggregated review 600 FIG. 6B , shows a search input area 618 for the first type of search.
  • a variety of user initiated activities can start the search process, including the user entering search terms in search input area 618 , selecting a hyperlink for a frequently appearing phrase 620 , selecting a hyperlink to a review source 622 , or selecting a hyperlinked portion (e.g., 610 , 612 , 614 , or 616 ) of histogram 608 .
  • client 104 sends ( 502 , FIG. 5 ) a request to search within a plurality of reviews for a particular product.
  • frontend server 100 receives ( 504 ) the search request from client 104 and obtains ( 506 ) information satisfying the request.
  • the information is obtained by review search module 258 sending two queries to reviews database 242 .
  • the first query all individual reviews (sometimes called “documents”) that contain the search terms are found.
  • the first query is restricted to the main body (e.g., the “review content” fields) of the reviews index 250 , and only to individual reviews that belong to the product (cluster).
  • the query “customer service” for reviews in cluster 1234 would be translated to the following:
  • ClusterID 1234, body:(customer service).
  • a second query is then issued to obtain portions of all reviews that are to be displayed.
  • the ordering preferences submitted by the user e.g., by clicking on 636 , 638 , 640 , and/or 642 ) as well as the start position are used to determine which reviews to display.
  • the start position indicates the ordinal number within an ordered set of reviews at which to begin formatting reviews for display.
  • Frontend server 100 sends ( 508 , FIG. 5 ) at least a portion of the obtained information to client 104 .
  • Client 104 receives ( 510 ) information satisfying the request and displays ( 512 ) the received information, e.g. in web page 660 , FIG. 6C .
  • Client 104 displays at least portions 662 of reviews of the product matching the search terms 664 .
  • the information sent by frontend server 100 and received by client 104 includes at least portions 662 of reviews for the particular product that satisfy the search request. In some embodiments, this information includes the identities 666 of the review sources contributing to the sent information.
  • the information is also obtained by review search module 258 sending two queries to reviews database 242 .
  • first query all individual reviews that contain the search terms are found, without restricting the search to a particular product (cluster).
  • the results of the first query are grouped by cluster (product).
  • a second query is then issued to obtain portions of at most three (or some other small number) reviews in each cluster, which are restricted to be from different publishers.
  • information about each of the clusters such as the total number of reviews and the average rating, is also requested, obtained, and sent for display.
  • the graphical user interface 600 ( FIG. 6B ) on client computer 104 comprises a plurality of portions 626 of reviews for a product and a search input area 618 for entering search terms to search for reviews of the product that contain the search terms.
  • the plurality of portions of reviews can be grouped by review source 636 , sorted by date 642 , and sorted by relevance to the search input 640 , or sorted by quality of review when the user has not provided a search input.
  • more or fewer sorting and grouping options may be provided for sorting and/or grouping the reviews.
  • graphical user interface 600 also includes, either alone or in various combinations: an aggregate rating 604 for the product; a list of suggested search terms 620 ; a list of sources 622 for the reviews that link back to the source Web sites or to the corresponding reviews; at least one user selectable link for sorting the portions of reviews by date 642 or relevance to the search input 640 , and/or for grouping the portions of reviews by source 636 ; and/or a histogram 608 of reviews for the product.
  • clicking on a region e.g., 610 , 612 , 614 , or 616
  • clicking on region 616 may generate a new display showing portions of reviews having high ratings for the product.
  • An aggregated rating 604 is given to the product by calculating an average over all reviews for the product.
  • the ratings in the individual reviews 244 and in the aggregated reviews 246 are normalized, e.g., to a 0 - 1 or 0 - 5 scale.
  • the aggregated rating 604 and number 606 of reviews are changed to reflect the average rating and number of reviews matching the search. This change in the aggregated rating can help users answer questions like, “How do reviewers that discuss ‘customer service’ rate this product?”
  • the aggregated rating 604 and number 606 of reviews are not changed to reflect the average rating and number of reviews matching the search.
  • Commonly used phrases e.g., unigrams, bigrams or trigrams appearing in the reviews for the product are displayed as a list of suggested search terms 620 . Phrases that contain the product name and stop-words (such as “the”) are removed to improve the quality of the suggested search terms. Clicking on one of the phrases results in displaying just the reviews that contain the user selected phrase. This is equivalent to entering the phrase into the search box 618 with quotes around the phrase and then initiating a search. Showing these phrases gives the user a good idea which aspects of the product are most frequently discussed in the product reviews, and thus which aspects of the product might be important to know about.
  • the list of review sources 622 includes links to the sites from which reviews for the product have been obtained, or a subset thereof. In some embodiments, after a search within reviews for a product, the list of review sources 622 just includes links to the sites from which reviews matching the search have been obtained, or a subset thereof. In some embodiments, the links lead to the main pages for the given product on each source web site. In some embodiments, the links lead to the product reviews matching the search.
  • a histogram 608 with a rating distribution of the reviews includes the number of reviews that are unrated 610 (in some embodiments, displayed as a gray area), negative 612 (in some embodiments, displayed as a red area), neutral 614 (in some embodiments, displayed as a yellow area) and positive 616 (in some embodiments, displayed as a green area).
  • the rating distribution is displayed as a flat histogram, where the length of each colored bar is proportional to the number of reviews in the category. Clicking on a histogram results in a search 262 that just shows the corresponding reviews, e.g. clicking on the red bar 612 results in showing just the negative reviews.
  • Search input area 618 allows a user to enter arbitrary search terms for the search within reviews for a given product. As shown in FIG. 6C , in some embodiments the search terms are highlighted in portions of reviews 626 that match the search query.
  • User selectable links permit the displayed list of reviews to be sorted in various ways, such as by:
  • a portion 626 of the review e.g., a snippet
  • the portion or snippet 626 of each review may be generated using the same or similar techniques to those used to generate snippets of search results in an Internet search engine such as the Google search engine.
  • the portion of each review shown may be a portion matching one or more search terms that were used to select the reviews to be displayed.
  • the portion of each review shown may be a beginning portion of the review.
  • the portion of each review shown may be a portion determined by an automatic process to be a summary or most important portion of the review.

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Theoretical Computer Science (AREA)
  • Strategic Management (AREA)
  • Accounting & Taxation (AREA)
  • Development Economics (AREA)
  • Finance (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Game Theory and Decision Science (AREA)
  • General Business, Economics & Management (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Radar, Positioning & Navigation (AREA)
  • Remote Sensing (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

Systems and methods for clustering user reviews are disclosed in which a plurality of user reviews are extracted from electronic documents. The electronic documents contain user reviews of a plurality of items of interest. A set of user reviews is identified in the plurality of user reviews as being associated with the same item of interest in the plurality of items of interest. Item identifying information included in the electronic documents is used for this identification. The set of user reviews is then associated with the same item of interest. Examples of item identifying information include unique product identifiers, brand names, model numbers, and category information. In some instances, the item identifying information is extracted from metadata included in the electronic document. In some instances, the electronic documents are obtained from e-commerce websites or product-review websites.

Description

    RELATED APPLICATIONS
  • This application is a continuation of U.S. patent application Ser. No. 13/407,650, filed on Feb. 28, 2012, now U.S. Pat. No. ______, which is a continuation of U.S. patent application Ser. No. 13/102,023, filed on May 5, 2011, now U.S. Pat. No. 8,161,030, which is a continuation of U.S. patent application Ser. No. 11/012,500, filed on Dec. 14, 2004, now U.S. Pat. No. 7,962,461. All the above-identified patents and patent applications are hereby incorporated by reference herein in their entireties.
  • TECHNICAL FIELD
  • The disclosed embodiments relate generally to search engines. More particularly, the disclosed embodiments relate to methods, systems, and graphical user interfaces for finding, aggregating, and providing reviews for a product.
  • BACKGROUND
  • Many Internet users research a product (e.g., a Canon digital camera) or a service (e.g., a hotel room in Hawaii) before obtaining it. Currently, the approach that many users follow is to use Internet search engines. Users issue a search query that contains the product or service name plus terms like “review” and sift through the myriad of results that are returned by the search engine. While some of these results do contain reviews about the product, many results do not. Moreover, it is very difficult to get a holistic view of all the reviews. In short, this process is fairly cumbersome, time consuming, and inefficient.
  • Alternatively, users may go to Web sites that provide reviews and ratings for products, such as www.amazon.com, www.epinions.com, and www.circuitcity.com. A few sites, such as www.rottentomatoes.com, movies.yahoo.com, and www.consumersearch.com, aggregate reviews for the same product from several sites. However, some of these sites aggregate reviews manually, which is very inefficient and limits the number of reviews that can be gathered for each product. Moreover, all of these sites are very limited in the tools they provide users to navigate through the reviews.
  • Thus, it would be highly desirable to provide tools that enable users to more efficiently conduct research on the products and services they are interested in obtaining (e.g., by purchase, lease, rental, or other similar transaction).
  • SUMMARY
  • Systems and methods for clustering user reviews are disclosed. In some implementations, a method comprising: at a computer system: extracting a plurality of user reviews from electronic documents, wherein the electronic documents contain user reviews of a plurality of items of interest; identifying a set of user reviews in the plurality of user reviews as being associated with the same item of interest in the plurality of items of interest, using item identifying information included in the electronic documents; and associating the set of user reviews with the same item of interest.
  • In some implementations, the item identifying information includes one of: a unique product identifier, brand name, model number, category information. In some implementations, the item identifying information is extracted from metadata included in the electronic document. In some implementations, the electronic documents are obtained from e-commerce websites or product-review websites.
  • The embodiments disclosed herein include new, more efficient ways to collect product reviews from the Internet, aggregate reviews for the same product, and provide an aggregated review to end users in a searchable format.
  • As used in the specification and claims, the word “product” refers to both products and services. Moreover, “product” encompasses virtually any product, service or combination thereof that can be bought, leased, rented, or similarly obtained. Exemplary products include, without limitation, consumer products, business products, movies, books, restaurants, hotels, and travel packages.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • For a better understanding of the aforementioned aspects of the disclosed embodiments as well as additional aspects and embodiments thereof, reference should be made to the Description of Embodiments below, in conjunction with the following drawings in which like reference numerals refer to corresponding parts throughout the figures.
  • FIG. 1 is a block diagram illustrating an exemplary distributed computer system according to an embodiment of the present disclosure.
  • FIG. 2A is a block diagram illustrating an aggregated reviews backend server 102 in accordance with one embodiment of the present disclosure.
  • FIG. 2B is a block diagram illustrating an aggregated reviews frontend server 100 in accordance with one embodiment of the present disclosure.
  • FIG. 3 is a flowchart representing a method for collecting and aggregating reviews for a product according to one embodiment.
  • FIG. 4 is a flowchart representing a method for displaying an aggregated review for a product according to one embodiment.
  • FIG. 5 is a flowchart representing a method for searching aggregated reviews for a product according to one embodiment.
  • FIGS. 6A-6D are schematic screen shots of an exemplary graphical user interface of a product information module (FIG. 6A) and a product review presentation module (FIGS. 6B-6D) according to one embodiment.
  • DESCRIPTION OF EMBODIMENTS
  • Methods, systems, and graphical user interfaces are described that show how to find, aggregate, and provide reviews for a product. Reference will be made to certain embodiments of the present disclosure, examples of which are illustrated in the accompanying drawings. While the present disclosure will be described in conjunction with the embodiments, it will be understood that it is not intended to limit the present disclosure to these particular embodiments alone. On the contrary, the present disclosure is intended to cover alternatives, modifications and equivalents that are within the spirit and scope of the present disclosure as defined by the appended claims.
  • Moreover, in the following description, numerous specific details are set forth to provide a thorough understanding of the present disclosure. However, it will be apparent to one of ordinary skill in the art that the embodiments disclosed in the present disclosure may be practiced without these particular details. In other instances, methods, procedures, components, and networks that are well-known to those of ordinary skill in the art are not described in detail to avoid obscuring aspects of the present disclosure.
  • FIG. 1 is a block diagram illustrating an exemplary distributed computer system according to an embodiment disclosed in the present disclosure. This system includes client computer 104, aggregated reviews backend server 102, aggregated reviews frontend server 100, multiple Web sites such as Web sites 108 and 110, and communication network(s) 106 for interconnecting these components. Client 104 includes graphical user interface (GUI) 112. Aggregated reviews backend server 102 collects product reviews from multiple Web sites 108-110, identifies particular products that are associated with particular product reviews, generates aggregated review information for particular products, and stores the product reviews and the aggregated review information. Aggregated reviews frontend server 100 receives and responds to requests from client 104 to provide an aggregated review for a product and/or to search within reviews for a particular product. GUI 112 includes portions of a plurality of reviews for a product and a search input area for entering search terms to search for reviews of the product that contain the search terms.
  • FIG. 2A is a block diagram illustrating aggregated reviews backend server 102 in accordance with one embodiment of the present disclosure. Server 102 typically includes one or more processing units (CPU's) 202, one or more network or other communications interfaces 204, memory 206, and one or more communication buses 214 for interconnecting these components. Server 102 optionally may include a user interface 208 comprising a display device 210 and a keyboard 212. Memory 206 may include high speed random access memory and may also include non-volatile memory, such as one or more magnetic disk storage devices. Memory 206 may optionally include one or more storage devices remotely located from the CPU(s) 202. In some embodiments, the memory 206 stores the following programs, modules and data structures, or a subset or superset thereof:
      • an operating system 216 that includes procedures for handling various basic system services and for performing hardware dependent tasks;
      • a communication module 218 that is used for connecting server 102 to other computers (e.g., Web sites 108 and 110) via one or more communication interfaces 204 (wired or wireless), which in turn connect the backend server 102 to one or more communications networks such as the Internet, other wide area networks, local area networks, metropolitan area networks, and so on;
      • a crawling module 220 for crawling Web sites (e.g., 108 and 110), which includes
        • a URL queue 222 that stores URLs to be crawled by crawler 226;
        • a link parser 224 that tells crawler 226 how to extract new links to follow from a fetched page;
        • a crawler 226 that fetches web pages, and
        • a fetched pages buffer 228 that temporarily stores web pages fetched by crawler 226;
      • a review extraction module 230 for extracting product reviews from the information collected by crawling module 220, which includes a review parser 232 that extracts content pertaining to a review (e.g., the review text, author, and date), as well as content that identifies the product that the review is about (e.g. the product name, model number, and/or other product identifiers), from the fetched pages in buffer 228;
      • an extracted reviews buffer 234 that temporarily stores the reviews extracted by review parser 232;
      • a review aggregating module 236 that identifies particular products that are associated with particular product reviews and generates aggregated review information for particular products;
      • an aggregated review buffer 240 that stores aggregated review information for a product, such as the total number of reviews for the product, an average rating for the product, a distribution of ratings, and/or frequently appearing phrases in the extracted product reviews associated with the product; one or more additional types of aggregated review information may be stored in the aggregated review buffer 240 in other embodiments;
      • a review database 242 that stores individual reviews 244 and aggregated reviews 246;
      • a review indexer 248 that indexes the reviews (244 and 246) stored in the review database 242 so as produce a reviews index 250; and
      • the reviews index 250, which stores the index created by review indexer 248.
  • The reviews index 250 maps terms (e.g., words and phrases) to reviews. In some embodiments, the reviews index 250 also maps other values, such as ClusterIDs (i.e., product identifiers) or review author names, to the corresponding reviews.
  • FIG. 2B is a block diagram illustrating aggregated reviews frontend server 100 in accordance with one embodiment of the present disclosure. The aspects of FIG. 2B that are analogous to those in FIG. 2A are not described again here. The memory 276 in frontend server 100 stores the following programs, modules and data structures, or a subset or superset thereof:
      • an operating system 216, as described above;
      • a communication module 218, as described above;
      • a product database 252 for storing product-related information, which includes
        • product and vendor information 254 such as product names, models, categories, and purchasing information; and
        • a product-to-review map 256 that associates products in database 252 with their corresponding ClusterIDs (i.e., product identifiers); the ClusterIDs are mapped by the reviews index 250 to individual reviews 244 and aggregated reviews 246;
      • reviews index 250, as described above;
      • a review search module 258 that answers search requests, which includes
        • a “search all reviews” application 260 for searching all reviews in reviews database 242; and
        • a “search within reviews for a product” application 262 for searching within the set of reviews for a particular product, to identify those reviews containing one or more search terms; and
      • a presentation module 264 for formatting aggregated reviews and search results for display, including
        • a reviews formatter 266 for formatting individual reviews and aggregated review information for display; and
        • a reviews sorter 268 for sorting the reviews for a particular product in any of the following ways, or a subset or superset thereof: by date, review source (e.g., by website), review quality, or review relevance to one or more search terms.
  • The reviews database 242 and reviews index 250 are generated by the backend server 102, and are copied from the backend server to the frontend server 100 for responding to user requests for product review information.
  • Referring to FIGS. 2A and 2B, each of the above identified modules and applications corresponds to a set of instructions for performing a function described above. These modules (i.e., sets of instructions) need not be implemented as separate software programs, procedures or modules, and thus various subsets of these modules may be combined or otherwise re-arranged in various embodiments. In some embodiments, memory 206 or 276 may store a subset of the modules and data structures identified above. Furthermore, memory 206 or 276 may store additional modules and data structures not described above.
  • Although FIGS. 2A and 2B show servers 100 and 102 as a number of discrete items, FIGS. 2A and 2B are intended more as functional descriptions of the various features which may be present in servers 100 and 102 rather than as a structural schematic of the embodiments described herein. In practice, and as recognized by those of ordinary skill in the art, items shown separately could be combined and some items could be separated. For example, some items shown separately in FIGS. 2A and 2B could be implemented on single servers and single items could be implemented by one or more servers. The actual number of servers used to implement backend server 102 and frontend server 100 and how features are allocated among them will vary from one implementation to another, and may depend in part on the amount of data traffic that the system must handle during peak usage periods as well as during average usage periods.
  • Collecting Product Reviews
  • FIG. 3 is a flowchart representing a method for collecting and aggregating reviews for a product according to one embodiment. The process shown in FIG. 3 is performed by aggregated reviews backend server 102 (FIGS. 1 and 2A). It will be appreciated by those of ordinary skill in the art that one or more of the acts described may be performed by hardware, software, or a combination thereof, as may be embodied in one or more computing systems.
  • Backend server 102 collects product reviews, or information containing product reviews for a plurality of products (302).
  • In some embodiments, collecting information comprises selectively crawling review-related Web sites or portions thereof and retrieving information containing product reviews. Selective crawling differs from traditional crawling in that the crawler only follows selected links on Web pages, rather than all links.
  • To initiate a crawl, crawling module 220 is given a set of starting (sometimes called “seed”) URLs (Uniform Resource Locators, i.e., web addresses) to crawl.
  • Crawling module 220 puts these URLs in queue 222. Crawler 226 continuously fetches pages corresponding to the URLs in the queue 222 until either there more no more URLs in the queue 222, or a crawl termination condition is reached. In some embodiments, link parser 224 tells crawler 226 how to extract new links to follow from a fetched page by matching regular expression patterns. In some embodiments, link parser 224 tells crawler 226 how to extract new links to follow from a fetched page by using text classification techniques. As an example of the former approach, Table 1 contains pseudo code for a link parser 224 that selects which links to follow on an exemplary website (www.productcompany.com) for reviews of electronics products.
  • TABLE 1
    Exemplary Pseudo Code for Crawling Module with Link Parser
    Initialize crawler queue with seed page(s) from the www.productcompany.com website;
    For each URL in the queue, do the following {
    download page from URL, and remove the URL from the queue;
    extract product reviews, if any, on the page, and parse and store the extracted product
    reviews;
    search page to see if any portion of the page matches any predefined pattern in a set of
    predefined patterns (contentpattern1, contentpattern2, ...);
    if the page does not match any of the one or more predefined patterns {
    processing of the page is complete;}
    // otherwise, continue processing the page
    collect all links on the page;
    process each link as follows {
    if the URL of the link matches any predefined pattern in a set of predefined
    patterns (linkpattern1, linkpattern2, ...) OR
    if the anchor text of the link matches any predefined pattern in a set of
    predefined patterns (anchorpattern1, anchorpattern2, ...), do the following: {
    optional: remove unneeded fields from the URL of the link;
    put the URL of the link on the crawler queue;}
    } // Links that do not match any of the predefined pattern(s) are not put on the queue
    // The patterns used in the processing of the crawled pages may be specific to the
    // website from which product review information is being extracted.
  • Crawler 226 outputs the crawled pages to fetched pages buffer 228. In some embodiments, the fetched pages buffer may contain one or more files or other data structures. For each page that is fetched, link parser 224 parses the page and attempts to extract links (URLs) to follow that, if found, are added to queue 222. As noted above, in contrast to traditional crawls, not all the links in a page are followed by the crawler. Rather, only those links that parser 224 identifies as potentially leading to review pages are followed. For instance, parser 224 may identify such links based on the presence of terms or patterns in the URLs of the links, or based on the anchor text of the links. Anchor text is text contained in or otherwise associated with a link.
  • The information containing product reviews can be collected from a wide variety of sources, including web pages and forum postings. In some embodiments, the web pages comprise Web pages of review-related Web sites and/or product-related Web sites.
  • In some embodiments, collecting information comprises receiving product reviews in a predetermined digital format, for example as data streams from multiple product reviewers. The predetermined digital format of the product reviews may comprise key/value pairs, or data organized in predefined data structures. In some embodiments, collecting information comprises selecting information from a repository of information collected by a web crawler.
  • Backend server 102 automatically extracts product reviews from the collected information (304). In particular, review parser 232, which is analogous to link parser 224, extracts content pertaining to a review (e.g., the review text, author, and date), as well as content that identifies the product that the review is about (e.g. the product name, model number, or other features), from the fetched pages in buffer 228. The extracted information is stored in extracted reviews buffer 234. In some embodiments, extracted reviews buffer 234 contains a list of key/value pairs denoting the extracted product review information, such as the review author, content, publisher, and so on.
  • The information extracted from a single web page may correspond to multiple reviews. In some embodiments, each extracted review found on a web page is encoded and stored in a respective review record in the extracted reviews buffer 234. Table 2 lists the fields of each review record in the extracted reviews buffer 234 in an exemplary embodiment. In other embodiments, each review record may have a subset of the fields listed in Table 2, and may also include additional fields. For example, in some embodiments, the review records might not include the ReviewType and CrawlDate fields. In some embodiments, the Rating field may be a numeric field, instead of a string field. Alternately, in some embodiment, each review record may include both a string Rating field and a normalized numeric rating field (e.g., normalized with respect to a rating scale of 0 to 1, or 0 to 5).
  • In Table 2, the keyword “required” indicates a required field, “repeated” indicates a field that may occur more than once in a record (and in rare cases, may have zero occurrences), and “optional” indicates an optional field. Optional fields may be present in some review records, while not present in other review records, for example because some reviews may not include the optional information.
  • TABLE 2
    Exemplary Fields for Individual Review Records in
    Individual Reviews Buffer and Reviews Database
    required int64 ReviewID // unique identifier for each review
    repeated string ProductNames // one or more product names associated
    with the review
    required int32 ReviewType // exemplary values: editorial = 1, user = 2
    required string Rating // product rating given by the reviewer
    optional string Author // author of the review
    optional string Title // title of the review
    optional string Summary // review summary
    optional string Content // review content, which is the main body
    of the review
    optional string Date // Review date
    required int32 CrawlDate // crawl date
    required string URL // URL at which review was found.
    // Alternately, this may be the base URL
    for product
    // reviews at the website at which this
    review was found
    required string Publisher // publisher of the review (e.g., CNET,
    epinions.com)
    // Additional Optional Fields that may be included in some embodiments:
    optional int64 ClusterID // ClusterID of the product
    optional float QualityScore // Quality Score of the review
    optional int32 Year // Model year, or other year associated
    with the product
    // (e.g., year of first publication of a book
    or a movie)
  • The data structure fields shown in Table 2, except ReviewID, ReviewType, QualityScore and ClusterID, are filled with data from the extracted review information if information relevant to those fields are found in the extracted review information. The ReviewID is a unique identifier assigned to each individual review by the aggregated reviews backend server. The ReviewType indicates whether the review is an editorial review, or a review submitted by a user. QualityScore, which provides a measure of the quality of a review, is determined based on the review's length and word entropy (which measures, among other things, the diversity of the words or terms used in the review). As explained below, all reviews belonging to the same product form a “cluster” and the ClusterID identifies the cluster to which a review belongs.
  • Clustering Reviews (Identifying Reviews for the Same Product)
  • Backend server 102 automatically identifies particular products that are associated with particular product reviews (306). For at least some of the extracted product reviews, backend server 102 identifies a particular product that is associated with the extracted product review.
  • In some embodiments, identifying a particular product that is associated with the extracted product review comprises associating a unique number or string in the extracted product review with a particular product. In some embodiments, the unique number or string is a product model number. In some embodiments, the unique number is an ISBN number.
  • In some embodiments, identifying a particular product that is associated with the extracted product review comprises associating a brand name and a model number in the extracted product review with a particular product. In some embodiments, identifying a particular product that is associated with the extracted product review comprises associating a brand name, a model number, and a product category in the extracted product review with a particular product.
  • Different web sites often use different names for the same product, which makes it difficult to automatically determine whether two reviews actually refer to the same product. For example, different web sites refer to the same Canon scanner as:
      • Canon CanoScan 7890a002 Flatbed 7890a002
      • Canon CanoScan LiDE 30
      • Canon CanoScan LiDE 30 Scanner
      • Canon CanoScan LiDE 30 Color Scanner
      • Canon Lide 30 (7890A002)
  • For products with global identifiers (such as ISBN numbers for books), identifying a particular product that is associated with the extracted product review is simple if the product review includes the global identifier. In some embodiments, in the absence of such a global identifier in the review, extracted brand names and model numbers (in the above example, the brand name is “canon”, and the model numbers are “lide30” and “7890a002”), as well as category information (such as the fact that the above product is a scanner) are used to identify reviews that refer to the same product. As used herein, reviews about the same product form a “cluster” and “clustering” refers to identifying reviews for the same product. As described above, in some embodiments, this information is stored in review records (Table 2) by review extraction module 230. Extracted brand names, model numbers and category information may all be stored in the ProductNames fields of the review records. Alternately, in some embodiments each review record may include a Model field for storing a model number, a Category field for storing product category information, and a Names field for storing brand name information for a respective product.
  • Sometimes, one or more of these three pieces of information (Model, Category and Brand Name) are not explicitly identified on a web page. In such cases, this information is deduced from the title string or other fields of the extracted review. For example, product category and model number can be inferred by the review parser 232 or the review aggregating module 236 as follows. Given a title string, the part of the title string beyond words like “for”, “with”, and “featuring” is ignored. Words in the title string are traversed from right to left and the first word (or pair of words) that is included in a data file containing a predefined list of product categories is considered the product category. In some embodiments, this data file also identifies, for each distinct product category, a list of other product category names for the product category that are mapped to one canonical name. For example, “notebooks”, “notebook”, “laptop” or “laptops” may all be mapped to the canonical name, “notebooks”. Thus, a title string such as “Canon canoscan lide 30 color scanner with free extras” would be mapped to the category “scanner”.
  • To extract model numbers, the title string is traversed from left to right and the first word that contains digits (i.e., characters from ‘0’-‘9’) is concatenated with the preceding word to form a model number. In some embodiments, the resulting model number is discarded if it matches any of the regular expression patterns corresponding to blacklisted model numbers or contains a brand name or parts of a brand name. An example of a blacklisted pattern is “pentium \d+” +” (i.e., “pentium” followed by a string of digits). For the title string in the previous example, “lide 30” is extracted as the model number. Note that more than one model number can be inferred for a product name.
  • In some embodiments, the brand names are identified by matching words in the product title with known (i.e., predefined) brand names in a list or dictionary of such brand names. In some embodiments, the brand name of a product may be extracted from the URL of the page on which the product review is found.
  • In some embodiments, two product reviews are assigned to the same cluster (product) if they have the same exact brand name, and agree on their model numbers and categories. Two reviews are said to agree on a model number if the set of model numbers extracted for them have a non-empty intersection. Two categories are said to agree if they are the same or at least one of them is unknown.
  • In some embodiments, two product reviews are assigned to the same cluster (product) if they have the same exact brand name and agree on their model numbers.
  • In some embodiments, reviews corresponding to products for which brand or model information cannot be determined are put into a singleton cluster of their own and are not clustered with any other reviews.
  • In some embodiments, when a product review is assigned to a cluster, review aggregating module 236 fills in the ClusterID field in the review record for the extracted review.
  • Generating Aggregate Information
  • For each particular product in at least a subset of the plurality of products, backend server 102 generates aggregated review information for the particular product based on a plurality of extracted product reviews that are associated with the particular product (308). In some embodiments, review aggregating module 236 uses the reviews for a particular product to generate aggregated review information for the product that is stored in aggregated review buffer 240.
  • In some embodiments, the aggregated review information for the particular product comprises a total number of reviews for the product, an average rating for the product, a distribution of the ratings for the product, and/or frequently appearing phrases in the extracted product reviews associated with the product. One or more additional types of aggregated review information may be included in the aggregated review information in other embodiments.
  • In some embodiments, the average rating is the average of weighted ratings from different web sites. In some embodiments, the weight of a single review on a web site with K reviews for the same product is 1/K. In some embodiments, the weight given to reviews from a particular web site or reviewer can be personalized by a user. For example, a user can specify via a preferences dialog box one or more web sites (e.g., www.consumerreports.org) and/or reviewers (e.g., Roger Ebert) preferred by the user. The ratings given in the reviews associated with the user's preferred sources are then given higher weight when computing an average rating for a product. In one embodiment, the ratings given in reviews associated with the user's preferred sources are given N times the default weight assigned to those reviews, where N is a value such as 2, 2.5, 3 or any other appropriate value greater than 1. An exemplary representation of the average rating computation is:
  • average rating = a i rating i a i
  • where ai represents the weight given to each rating, ratingi, of each respective review i.
  • In some embodiments, frequently appearing phrases in the extracted product reviews associated with a particular product are computed by review aggregation module 236 as follows. For all the phrases (e.g., unigrams, bigrams or trigrams that do not span across sentences) that occur in reviews associated with a product (cluster), the number of their weighted occurrences is counted. Occurrences in the review title or summary are given a higher weight. When computing or selecting frequently appearing phrases, all stop words and words that occur in the title of the product name are ignored. Each phrase is given a score equal to the length of the phrase times the square root of the sum of weighted occurrences of the phrase. A predetermined number of phrases with the highest scores (e.g., the top 6, 7, 8, 9 or 10 phrases) that do not overlap each other are chosen as the frequent phrases and stored in aggregated review buffer 240.
  • Table 3 lists the fields of each aggregated review record, produced by aggregating information from all the extracted reviews for a particular product, and stored in the aggregated reviews buffer 240. In other embodiments, each aggregated review record may have a subset of the fields listed in Table 3, and may also include additional fields. A separate instance of the aggregated review record is included in the aggregated reviews buffer 240 and in the reviews database 242 for each distinct product. In this example, each aggregated review record includes a ClusterID (identifying the product associated with this record), a list of ReviewID's (each of which is uniquely associated with a review of the product), a list of the words or phrases most frequently found in the reviews of the product, and a computed average rating of all the reviews for the product. In some embodiments, an aggregated review record also includes one or more of the following additional fields: a string identifying the product, a number specifying the number of reviews in the cluster; and data concerning the distribution of ratings in the reviews for a product, such as the number of reviews with low, medium and high ratings, as well as the number of reviews having no product rating.
  • TABLE 3
    Exemplary Fields for Aggregated Review Records in
    Aggregated Reviews Buffer and Reviews Database
    required int64 ClusterID // this is the ClusterID for a particular
    product
    repeated int64 ReviewID // this represents a list of ReviewID's,
    // which are the IDs of the individual
    reviews for the product
    repeated string FrequentPhrases // This is a list of phrases that frequently
    // occur in the reviews of the product
    required float AvgRating // This is the average rating for the
    product
    // Additional Optional Fields that may be included in some embodiments:
    optional int32 ReviewNum // number of reviews in cluster
    repeated string ProductName // name(s) of Product to which this
    cluster belongs
    repeated int32 ScoreDistribution // score distribution for: unrated,
    // negative, neutral, and positive reviews
  • Backend server 102 automatically stores the extracted product reviews 244 and the aggregated review information 246 in reviews database 242 (310). In some embodiments, each stored product review in the reviews database 242 comprises a record that includes the same information as the corresponding review record (Table 2) stored in the extracted reviews buffer 234, while in other embodiments, the stored product review may comprise a subset of the information stored in the corresponding review record.
  • As mentioned earlier, reviews indexer 248 indexes the individual product reviews 244 and the aggregated review information for a product 246 and stores the results in reviews index 250.
  • Displaying an Aggregated Review
  • Client 104 sends a request for an aggregated review for a product (402, FIG. 4). In some embodiments, the request is sent in response to a user-initiated activity, such as a user clicking on a hyperlink. For example, referring to FIG. 6A, a user looking at web page 650 with various products for sale could activate hyperlink 652 to see an aggregated review for product 654.
  • Referring to FIG. 4, frontend server 100 receives the request (404) for an aggregated review for a product and obtains (406) the aggregated review. In some embodiments, product information 254 in product database 252 is linked to aggregated reviews for the corresponding products by product-to-review map 256. In some embodiments, map 256 contains mappings of product identifiers used by a first website or service (e.g., a product information website) to the ClusterIDs used by the frontend server 100. The process of assigning product identifiers to products in the first website or service may be analogous to the process described above for identifying particular products that are associated with particular product reviews. In some embodiments, map 256 is determined by matching a global identifier for a particular product (e.g., an ISBN number) in product database 252 with a corresponding review cluster. Product-to-review map 256 is used to identify the aggregated review (in the reviews database 242) corresponding to a product. Presentation module 264 formats the aggregated review. In some embodiments, the reviews index 250 is used to identify the individual reviews corresponding to the aggregated review for a product. More specifically, the reviews index 250 maps the ClusterID of the aggregated review to the individual reviews for the product associated with the aggregated review.
  • Frontend server 100 sends (408) the aggregated review to client computer 104 via communications network 106. Client 104 receives (410) and displays (412) the aggregated review 600 (FIG. 6B).
  • As shown in FIG. 6B, aggregated review 600 includes portions of reviews 626 for the product and a search input area 618 for entering search terms to search for reviews of the product that contain the search terms. Search input area 618 is used to enter search terms for a search that requests only results that are within the reviews for the product. In some embodiments, aggregated review 600 includes a histogram 608 of review ratings for the product, an aggregate rating 604 for the product, and/or the number 606 of reviews for the product. In some embodiments, aggregated review 600 is displayed in a single Web page.
  • Searching
  • In some embodiments, review search module 258 supports two different types of queries: (1) search within the reviews 262 belonging to a single product (e.g. a search for “customer service” in the reviews for a given product) and (2) search all reviews 260 to find a particular product (e.g. a search for “good digital camera”). Aggregated review 600, FIG. 6B, shows a search input area 618 for the first type of search.
  • For the first type of search, a variety of user initiated activities can start the search process, including the user entering search terms in search input area 618, selecting a hyperlink for a frequently appearing phrase 620, selecting a hyperlink to a review source 622, or selecting a hyperlinked portion (e.g., 610, 612, 614, or 616) of histogram 608.
  • In response to this user-initiated activity, client 104 sends (502, FIG. 5) a request to search within a plurality of reviews for a particular product.
  • Referring to FIG. 5, frontend server 100 receives (504) the search request from client 104 and obtains (506) information satisfying the request. In some embodiments, the information is obtained by review search module 258 sending two queries to reviews database 242. In the first query, all individual reviews (sometimes called “documents”) that contain the search terms are found. When searching within reviews for a single product, the first query is restricted to the main body (e.g., the “review content” fields) of the reviews index 250, and only to individual reviews that belong to the product (cluster). For example, the query “customer service” for reviews in cluster 1234 would be translated to the following:

  • query restricts: ClusterID=1234, body:(customer service).
  • A second query is then issued to obtain portions of all reviews that are to be displayed. The ordering preferences submitted by the user (e.g., by clicking on 636, 638, 640, and/or 642) as well as the start position are used to determine which reviews to display. When the number of reviews for a product exceeds the maximum number of review portions to be displayed on a single web page, the start position indicates the ordinal number within an ordered set of reviews at which to begin formatting reviews for display.
  • Frontend server 100 sends (508, FIG. 5) at least a portion of the obtained information to client 104.
  • Client 104 receives (510) information satisfying the request and displays (512) the received information, e.g. in web page 660, FIG. 6C. In some embodiments, Client 104 displays at least portions 662 of reviews of the product matching the search terms 664.
  • The information sent by frontend server 100 and received by client 104 includes at least portions 662 of reviews for the particular product that satisfy the search request. In some embodiments, this information includes the identities 666 of the review sources contributing to the sent information.
  • For the second type of search, the information is also obtained by review search module 258 sending two queries to reviews database 242. In the first query, all individual reviews that contain the search terms are found, without restricting the search to a particular product (cluster). In some embodiments, the results of the first query are grouped by cluster (product). A second query is then issued to obtain portions of at most three (or some other small number) reviews in each cluster, which are restricted to be from different publishers. In some embodiments, information about each of the clusters, such as the total number of reviews and the average rating, is also requested, obtained, and sent for display.
  • Graphical User Interface
  • The graphical user interface 600 (FIG. 6B) on client computer 104 comprises a plurality of portions 626 of reviews for a product and a search input area 618 for entering search terms to search for reviews of the product that contain the search terms. In some embodiments, the plurality of portions of reviews can be grouped by review source 636, sorted by date 642, and sorted by relevance to the search input 640, or sorted by quality of review when the user has not provided a search input. In some embodiments, more or fewer sorting and grouping options may be provided for sorting and/or grouping the reviews.
  • In some embodiments, graphical user interface 600 also includes, either alone or in various combinations: an aggregate rating 604 for the product; a list of suggested search terms 620; a list of sources 622 for the reviews that link back to the source Web sites or to the corresponding reviews; at least one user selectable link for sorting the portions of reviews by date 642 or relevance to the search input 640, and/or for grouping the portions of reviews by source 636; and/or a histogram 608 of reviews for the product. In some embodiments, clicking on a region (e.g., 610, 612, 614, or 616) in the histogram automatically generates portions of at least some of the reviews corresponding to the histogram region. For example, clicking on region 616 may generate a new display showing portions of reviews having high ratings for the product.
  • An aggregated rating 604 is given to the product by calculating an average over all reviews for the product. The ratings in the individual reviews 244 and in the aggregated reviews 246 are normalized, e.g., to a 0-1 or 0-5 scale. In some embodiments, after a search within reviews for a product 262, the aggregated rating 604 and number 606 of reviews are changed to reflect the average rating and number of reviews matching the search. This change in the aggregated rating can help users answer questions like, “How do reviewers that discuss ‘customer service’ rate this product?” In some other embodiments, the aggregated rating 604 and number 606 of reviews are not changed to reflect the average rating and number of reviews matching the search.
  • Commonly used phrases (e.g., unigrams, bigrams or trigrams) appearing in the reviews for the product are displayed as a list of suggested search terms 620. Phrases that contain the product name and stop-words (such as “the”) are removed to improve the quality of the suggested search terms. Clicking on one of the phrases results in displaying just the reviews that contain the user selected phrase. This is equivalent to entering the phrase into the search box 618 with quotes around the phrase and then initiating a search. Showing these phrases gives the user a good idea which aspects of the product are most frequently discussed in the product reviews, and thus which aspects of the product might be important to know about.
  • The list of review sources 622 includes links to the sites from which reviews for the product have been obtained, or a subset thereof. In some embodiments, after a search within reviews for a product, the list of review sources 622 just includes links to the sites from which reviews matching the search have been obtained, or a subset thereof. In some embodiments, the links lead to the main pages for the given product on each source web site. In some embodiments, the links lead to the product reviews matching the search.
  • A histogram 608 with a rating distribution of the reviews includes the number of reviews that are unrated 610 (in some embodiments, displayed as a gray area), negative 612 (in some embodiments, displayed as a red area), neutral 614 (in some embodiments, displayed as a yellow area) and positive 616 (in some embodiments, displayed as a green area). In some embodiments, the rating distribution is displayed as a flat histogram, where the length of each colored bar is proportional to the number of reviews in the category. Clicking on a histogram results in a search 262 that just shows the corresponding reviews, e.g. clicking on the red bar 612 results in showing just the negative reviews.
  • Search input area 618 allows a user to enter arbitrary search terms for the search within reviews for a given product. As shown in FIG. 6C, in some embodiments the search terms are highlighted in portions of reviews 626 that match the search query.
  • User selectable links permit the displayed list of reviews to be sorted in various ways, such as by:
      • Relevance 640: In some embodiments, editorial reviews are listed before user reviews. In some embodiments, the reviews are sorted by their length and information content, e.g., sorted by Review-per-word-entropy*sqrt(Review-Length). Other scoring functions, such as scoring reviews by an “information retrieval” score, can also be used.
      • Date 642: The reviews are sorted by date, with the newest reviews coming first. Reviews for which a date is unknown may be listed at the end.
      • Review Source 636: Reviews are grouped by review site, as represented by screen shot 670 in FIG. 6D, with at most N reviews (e.g., 3, 4 or 5, or some other small number) shown for each review site. At the top of each group, a link 672 to the corresponding review site is provided, which may be the same as the link provided to the site in the list of review sources 622.
      • Ungrouped 638 (e.g., not grouped by review source). This view is similar to traditional search result displays. Reviews are listed (i.e., portions of the reviews are displayed) one-by-one in a selected order (e.g., by date or relevance) and reviews from different sources are mixed (i.e., without grouping the portions of reviews by source).
      • Location (not shown in aggregated review 600, FIG. 6B). This view permits reviews to be sorted by location (e.g., for reviews of restaurants, local businesses, movies or other products near a particular location, such as by address, city, region, or mailing or zip code).
  • In some embodiments, for reviews listed in an aggregated review (e.g., graphical user interface 600 or 670) or reviews listed as results for a search within reviews for a product (e.g., graphical user interface 660, FIG. 6C) the following information is displayed: review title 624, a portion 626 of the review (e.g., a snippet), the source of the review 628, the date of the review 634, the type of review 630 (e.g., whether it is an editorial or user contributed review), and the name of the reviewer 632. The portion or snippet 626 of each review may be generated using the same or similar techniques to those used to generate snippets of search results in an Internet search engine such as the Google search engine. For example, when a graphical user interface is presenting reviews for a product, the portion of each review shown may be a portion matching one or more search terms that were used to select the reviews to be displayed. Alternately, the portion of each review shown may be a beginning portion of the review. In yet another embodiment, the portion of each review shown may be a portion determined by an automatic process to be a summary or most important portion of the review.
  • The foregoing description, for purpose of explanation, has been described with reference to specific embodiments. However, the illustrative discussions above are not intended to be exhaustive or to limit the present disclosure to the precise forms disclosed. Many modifications and variations are possible in view of the above teachings. The embodiments were chosen and described in order to best explain the principles of the present disclosure and its practical applications, to thereby enable others skilled in the art to best utilize the present disclosure and various disclosed embodiments with various modifications as are suited to the particular use contemplated.

Claims (20)

What is claimed is:
1. A method comprising:
at a computer system:
extracting a plurality of user reviews from one or more electronic documents, wherein the electronic documents contain user reviews of a plurality of items of interest;
identifying a set of user reviews in the plurality of user reviews as being associated with the same item of interest in the plurality of items of interest, using item identifying information included in the one or more electronic documents; and
associating the set of user reviews with the same item of interest.
2. The method of claim 1, wherein the item identifying information includes one of: a unique product identifier, brand name, model number, or category information.
3. The method of claim 1, wherein the item identifying information is extracted from metadata included in the one or more electronic documents.
4. The method of claim 1, wherein the item identifying information is extracted from a URL associated with an electronic document in the one or more electronic documents.
5. The method of claim 1, wherein the one or more electronic documents are obtained from e-commerce websites or product-review websites.
6. The method of claim 1, further comprising:
in response to a user search,
formatting for display a snippet of a user review in the set of user reviews.
7. The method of claim 1, further comprising:
in response to a user search,
formatting for display a representation of a count of user reviews included in the set of user reviews.
8. The method of claim 1, further comprising:
in response to a user search,
formatting for display one or more user reviews, in the plurality of user reviews, associated with the same review source.
9. A system comprising:
one or more processors;
memory; and
one or more programs, wherein the one or more programs are stored in the memory and configured to be executed by the one or more processors, the one or more programs including instructions for:
extracting a plurality of user reviews from one or more electronic documents, wherein the electronic documents contain user reviews of a plurality of items of interest;
identifying a set of user reviews in the plurality of user reviews as being associated with the same item of interest in the plurality of items of interest, using item identifying information included in the one or more electronic documents; and
associating the set of user reviews with the same item of interest.
10. The system of claim 9, wherein the item identifying information includes one of: a unique product identifier, brand name, model number, or category information.
11. The system of claim 9, wherein the item identifying information is extracted from metadata included in the one or more electronic documents.
12. The system of claim 9, wherein the item identifying information is extracted from a URL associated with an electronic document in the one or more electronic documents.
13. The system of claim 9, wherein the one or more electronic documents are obtained from e-commerce websites or product-review websites.
14. The system of claim 9, wherein the one or more programs further comprise instructions for:
in response to a user search,
formatting for display a snippet of a user review in the set of user reviews.
15. The system of claim 9, wherein the one or more programs further comprise instructions for:
in response to a user search,
formatting for display a representation of a count of user reviews in the set of user reviews.
16. The system of claim 9, wherein the one or more programs further comprise instructions for:
in response to a user search,
formatting for display one or more user reviews, in the plurality of user reviews, associated with the same review source.
17. A non-transitory computer readable storage medium having stored thereon one or more programs, wherein the one or more programs including instructions for:
extracting a plurality of user reviews from one or more electronic documents, wherein the electronic documents contain user reviews of a plurality of items of interest;
identifying a set of user reviews in the plurality of user reviews as being associated with the same item of interest in the plurality of items of interest, using item identifying information included in the one or more electronic documents; and
associating the set of user reviews with the same item of interest.
18. The non-transitory computer readable storage medium of claim 17, wherein the item identifying information includes one of: a unique product identifier, brand name, model number, or category information.
19. The non-transitory computer readable storage medium of claim 17, wherein the item identifying information is extracted from metadata included in the one or more electronic documents.
20. The non-transitory computer readable storage medium of claim 17, wherein the one or more electronic documents are obtained from e-commerce websites or product-review websites.
US13/754,646 2004-12-14 2013-01-30 Systems and Methods for Clustering User Reviews Abandoned US20130144862A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US13/754,646 US20130144862A1 (en) 2004-12-14 2013-01-30 Systems and Methods for Clustering User Reviews

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US11/012,500 US7962461B2 (en) 2004-12-14 2004-12-14 Method and system for finding and aggregating reviews for a product
US13/102,023 US8161030B2 (en) 2004-12-14 2011-05-05 Method and system for aggregating reviews and searching within reviews for a product
US13/407,650 US8380694B2 (en) 2004-12-14 2012-02-28 Method and system for aggregating reviews and searching within reviews for a product
US13/754,646 US20130144862A1 (en) 2004-12-14 2013-01-30 Systems and Methods for Clustering User Reviews

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US13/407,650 Continuation US8380694B2 (en) 2004-12-14 2012-02-28 Method and system for aggregating reviews and searching within reviews for a product

Publications (1)

Publication Number Publication Date
US20130144862A1 true US20130144862A1 (en) 2013-06-06

Family

ID=36585221

Family Applications (4)

Application Number Title Priority Date Filing Date
US11/012,500 Active 2028-11-12 US7962461B2 (en) 2004-12-14 2004-12-14 Method and system for finding and aggregating reviews for a product
US13/102,023 Expired - Fee Related US8161030B2 (en) 2004-12-14 2011-05-05 Method and system for aggregating reviews and searching within reviews for a product
US13/407,650 Expired - Fee Related US8380694B2 (en) 2004-12-14 2012-02-28 Method and system for aggregating reviews and searching within reviews for a product
US13/754,646 Abandoned US20130144862A1 (en) 2004-12-14 2013-01-30 Systems and Methods for Clustering User Reviews

Family Applications Before (3)

Application Number Title Priority Date Filing Date
US11/012,500 Active 2028-11-12 US7962461B2 (en) 2004-12-14 2004-12-14 Method and system for finding and aggregating reviews for a product
US13/102,023 Expired - Fee Related US8161030B2 (en) 2004-12-14 2011-05-05 Method and system for aggregating reviews and searching within reviews for a product
US13/407,650 Expired - Fee Related US8380694B2 (en) 2004-12-14 2012-02-28 Method and system for aggregating reviews and searching within reviews for a product

Country Status (2)

Country Link
US (4) US7962461B2 (en)
CN (1) CN101151607B (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060277290A1 (en) * 2005-06-02 2006-12-07 Sam Shank Compiling and filtering user ratings of products
US20130254181A1 (en) * 2012-01-03 2013-09-26 Be Labs, Llc Aggregation and Categorization
US20150178713A1 (en) * 2013-12-23 2015-06-25 Mastercard International Incorporated Method and system of providing financial transaction card related mobile apps
US20150264093A1 (en) * 2014-03-14 2015-09-17 ResearchGate Corporation Publication review user interface and system
US20160283497A1 (en) * 2015-03-27 2016-09-29 Ca, Inc. Analysis of mobile application reviews based on content, reviewer credibility, and temporal and geographic clustering
US10325274B2 (en) * 2014-01-31 2019-06-18 Walmart Apollo, Llc Trend data counter
US11010828B2 (en) 2011-06-22 2021-05-18 Rakuten, Inc. Information processing apparatus, information processing method, information processing program, recording medium having stored therein information processing program

Families Citing this family (268)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090024409A1 (en) * 2002-02-06 2009-01-22 Ryan Steelberg Apparatus, system and method for a brand affinity engine using positive and negative mentions
US20090112692A1 (en) * 2007-10-31 2009-04-30 Ryan Steelberg Engine, system and method for generation of brand affinity content
US20090018922A1 (en) * 2002-02-06 2009-01-15 Ryan Steelberg System and method for preemptive brand affinity content distribution
US20090112715A1 (en) * 2007-10-31 2009-04-30 Ryan Steelberg Engine, system and method for generation of brand affinity content
US20090112698A1 (en) * 2007-10-31 2009-04-30 Ryan Steelberg System and method for brand affinity content distribution and optimization
US20090228354A1 (en) * 2008-03-05 2009-09-10 Ryan Steelberg Engine, system and method for generation of brand affinity content
US7912781B2 (en) 2004-06-08 2011-03-22 Rosenthal Collins Group, Llc Method and system for providing electronic information for risk assessment and management for multi-market electronic trading
WO2005124632A2 (en) * 2004-06-08 2005-12-29 Rosenthal Collins, Group, Llc Method and system for providing electronic information for multi-market electronic trading
US8429059B2 (en) 2004-06-08 2013-04-23 Rosenthal Collins Group, Llc Method and system for providing electronic option trading bandwidth reduction and electronic option risk management and assessment for multi-market electronic trading
US20060010066A1 (en) * 2004-07-12 2006-01-12 Rosenthal Collins Group, L.L.C. Method and system for providing a graphical user interface for electronic trading
US7620586B2 (en) * 2004-09-08 2009-11-17 Rosenthal Collins Group, Llc Method and system for providing automatic execution of trading strategies for electronic trading
US7624064B2 (en) 2004-11-01 2009-11-24 Rosenthal Collins Group, Llc Method and system for providing multiple graphic user interfaces for electronic trading
US20060112130A1 (en) * 2004-11-24 2006-05-25 Linda Lowson System and method for resource management
WO2006063016A2 (en) 2004-12-09 2006-06-15 Rosenthal Collins Group, Llc Method and system for providing configurable features for graphical user interfaces for electronic trading
US7624102B2 (en) * 2005-01-28 2009-11-24 Microsoft Corporation System and method for grouping by attribute
US7788087B2 (en) * 2005-03-01 2010-08-31 Microsoft Corporation System for processing sentiment-bearing text
US7788086B2 (en) * 2005-03-01 2010-08-31 Microsoft Corporation Method and apparatus for processing sentiment-bearing text
US7801801B2 (en) 2005-05-04 2010-09-21 Rosenthal Collins Group, Llc Method and system for providing automatic execution of black box strategies for electonic trading
US8364575B2 (en) 2005-05-04 2013-01-29 Rosenthal Collins Group, Llc Method and system for providing automatic execution of black box strategies for electronic trading
US8589280B2 (en) 2005-05-04 2013-11-19 Rosenthal Collins Group, Llc Method and system for providing automatic execution of gray box strategies for electronic trading
US7617149B2 (en) 2005-05-31 2009-11-10 Rosenthal Collins Group, Llc Method and system for electronically inputting, monitoring and trading spreads
US7756475B2 (en) * 2005-06-03 2010-07-13 Broadcom Corporation WiFi interface to dialup modem
WO2007041391A2 (en) * 2005-09-30 2007-04-12 Rosenthal Collins Group, Llc Method and system for providing accounting for electronic trading
US8010480B2 (en) * 2005-09-30 2011-08-30 Google Inc. Selecting high quality text within identified reviews for display in review snippets
US20070078670A1 (en) * 2005-09-30 2007-04-05 Dave Kushal B Selecting high quality reviews for display
US7827052B2 (en) 2005-09-30 2010-11-02 Google Inc. Systems and methods for reputation management
US8438469B1 (en) 2005-09-30 2013-05-07 Google Inc. Embedded review and rating information
US7849000B2 (en) 2005-11-13 2010-12-07 Rosenthal Collins Group, Llc Method and system for electronic trading via a yield curve
US7734533B2 (en) * 2005-11-13 2010-06-08 Rosenthal Collins Group, Llc Method and system for electronic trading via a yield curve
US20070143122A1 (en) * 2005-12-06 2007-06-21 Holloway Lane T Business method for correlating product reviews published on the world wide Web to provide an overall value assessment of the product being reviewed
US20070192161A1 (en) * 2005-12-28 2007-08-16 International Business Machines Corporation On-demand customer satisfaction measurement
US9141713B1 (en) * 2005-12-30 2015-09-22 Amazon Technologies, Inc. System and method for associating keywords with a web page
CA2637558A1 (en) * 2006-02-09 2007-08-23 Unz.Org, Llc Organizing digitized content on the internet through digitized content reviews
US7860803B1 (en) * 2006-02-15 2010-12-28 Google Inc. Method and system for obtaining feedback for a product
US8489438B1 (en) * 2006-03-31 2013-07-16 Intuit Inc. Method and system for providing a voice review
US8112403B2 (en) * 2006-05-19 2012-02-07 Symantec Corporation Providing a rating for a web site based on weighted user feedback
WO2007146100A2 (en) * 2006-06-07 2007-12-21 Cnet Networks, Inc. Evaluative information system and method
US7831928B1 (en) 2006-06-22 2010-11-09 Digg, Inc. Content visualization
US8615440B2 (en) * 2006-07-12 2013-12-24 Ebay Inc. Self correcting online reputation
US20080059486A1 (en) * 2006-08-24 2008-03-06 Derek Edwin Pappas Intelligent data search engine
US8166029B2 (en) * 2006-09-07 2012-04-24 Yahoo! Inc. System and method for identifying media content items and related media content items
US7895127B2 (en) * 2006-09-29 2011-02-22 Weiser Anatoly S Rating-based sorting and displaying of reviews
US8108463B2 (en) * 2007-02-20 2012-01-31 Richrelevance, Inc. Demand-driven, collaborative systems and processes for collecting structured information
US20080249764A1 (en) * 2007-03-01 2008-10-09 Microsoft Corporation Smart Sentiment Classifier for Product Reviews
US7979459B2 (en) * 2007-06-15 2011-07-12 Microsoft Corporation Scalable model-based product matching
JP4359787B2 (en) * 2007-07-02 2009-11-04 ソニー株式会社 Information processing apparatus, content reputation search method, and content reputation search system
US20120246085A1 (en) * 2007-07-19 2012-09-27 Depalma Mark S Systems and methods for entity specific, data capture and exchange over a network
US20090063247A1 (en) * 2007-08-28 2009-03-05 Yahoo! Inc. Method and system for collecting and classifying opinions on products
US20090112714A1 (en) * 2007-10-31 2009-04-30 Ryan Steelberg Engine, system and method for generation of brand affinity content
US20100030746A1 (en) * 2008-07-30 2010-02-04 Ryan Steelberg System and method for distributing content for use with entertainment creatives including consumer messaging
US20100318375A1 (en) * 2007-09-07 2010-12-16 Ryan Steelberg System and Method for Localized Valuations of Media Assets
US20100131085A1 (en) * 2007-09-07 2010-05-27 Ryan Steelberg System and method for on-demand delivery of audio content for use with entertainment creatives
US8452764B2 (en) * 2007-09-07 2013-05-28 Ryan Steelberg Apparatus, system and method for a brand affinity engine using positive and negative mentions and indexing
US20100106601A1 (en) * 2007-09-07 2010-04-29 Ryan Steelberg System and method for distributing text content for use in one or more creatives
US20100131337A1 (en) * 2007-09-07 2010-05-27 Ryan Steelberg System and method for localized valuations of media assets
US20100114703A1 (en) * 2007-09-07 2010-05-06 Ryan Steelberg System and method for triggering development and delivery of advertisements
US20100082598A1 (en) * 2008-02-07 2010-04-01 Brand Affinity Technologies, Inc. Engine, system and method for generation of brand affinity content
US9633505B2 (en) 2007-09-07 2017-04-25 Veritone, Inc. System and method for on-demand delivery of audio content for use with entertainment creatives
US20090112718A1 (en) * 2007-10-31 2009-04-30 Ryan Steelberg System and method for distributing content for use with entertainment creatives
US20100217664A1 (en) * 2007-09-07 2010-08-26 Ryan Steelberg Engine, system and method for enhancing the value of advertisements
US20110047050A1 (en) * 2007-09-07 2011-02-24 Ryan Steelberg Apparatus, System And Method For A Brand Affinity Engine Using Positive And Negative Mentions And Indexing
US8285700B2 (en) 2007-09-07 2012-10-09 Brand Affinity Technologies, Inc. Apparatus, system and method for a brand affinity engine using positive and negative mentions and indexing
US20090112717A1 (en) * 2007-10-31 2009-04-30 Ryan Steelberg Apparatus, system and method for a brand affinity engine with delivery tracking and statistics
US9294727B2 (en) 2007-10-31 2016-03-22 Veritone, Inc. System and method for creation and management of advertising inventory using metadata
US20100114701A1 (en) * 2007-09-07 2010-05-06 Brand Affinity Technologies, Inc. System and method for brand affinity content distribution and optimization with charitable organizations
US8725563B2 (en) * 2007-09-07 2014-05-13 Brand Affinity Technologies, Inc. System and method for searching media assets
US20110040648A1 (en) * 2007-09-07 2011-02-17 Ryan Steelberg System and Method for Incorporating Memorabilia in a Brand Affinity Content Distribution
US7809603B2 (en) * 2007-09-07 2010-10-05 Brand Affinity Technologies, Inc. Advertising request and rules-based content provision engine, system and method
US20100131357A1 (en) * 2007-09-07 2010-05-27 Ryan Steelberg System and method for controlling user and content interactions
US20100114719A1 (en) * 2007-09-07 2010-05-06 Ryan Steelberg Engine, system and method for generation of advertisements with endorsements and associated editorial content
US20100274644A1 (en) * 2007-09-07 2010-10-28 Ryan Steelberg Engine, system and method for generation of brand affinity content
US20110078003A1 (en) * 2007-09-07 2011-03-31 Ryan Steelberg System and Method for Localized Valuations of Media Assets
US8751479B2 (en) * 2007-09-07 2014-06-10 Brand Affinity Technologies, Inc. Search and storage engine having variable indexing for information associations
US20090112700A1 (en) * 2007-10-31 2009-04-30 Ryan Steelberg System and method for brand affinity content distribution and optimization
US20090083096A1 (en) * 2007-09-20 2009-03-26 Microsoft Corporation Handling product reviews
US7539632B1 (en) * 2007-09-26 2009-05-26 Amazon Technologies, Inc. Method, medium, and system for providing activity interest information
US20100076866A1 (en) * 2007-10-31 2010-03-25 Ryan Steelberg Video-related meta data engine system and method
US20110106632A1 (en) * 2007-10-31 2011-05-05 Ryan Steelberg System and method for alternative brand affinity content transaction payments
US20090299837A1 (en) * 2007-10-31 2009-12-03 Ryan Steelberg System and method for brand affinity content distribution and optimization
US20090144226A1 (en) * 2007-12-03 2009-06-04 Kei Tateno Information processing device and method, and program
US20090144264A1 (en) * 2007-12-04 2009-06-04 Vik Singh Third-party information overlay on search results
US20090164266A1 (en) * 2007-12-21 2009-06-25 Microsoft Corporation Category aggregated opinion data
WO2009087636A1 (en) * 2008-01-10 2009-07-16 Yissum Research Development Company Of The Hebrew University Of Jerusalem Method and system for automatically ranking product reviews according to review helpfulness
EP2257919A4 (en) * 2008-02-07 2012-12-12 Brand Affinity Tech Inc Qualitative and quantitative method for rating a brand using keywords
US8965865B2 (en) * 2008-02-15 2015-02-24 The University Of Utah Research Foundation Method and system for adaptive discovery of content on a network
US7996390B2 (en) * 2008-02-15 2011-08-09 The University Of Utah Research Foundation Method and system for clustering identified forms
WO2009105277A1 (en) * 2008-02-22 2009-08-27 Socialrep, Llc System and method for measuring and managing distributed online conversations
US20090254862A1 (en) * 2008-03-03 2009-10-08 Kidzui, Inc Method and apparatus for user interface for child oriented computer network
US20090299796A1 (en) * 2008-05-30 2009-12-03 Microsoft Corporation Configuration of resources for an entity
US20090307053A1 (en) * 2008-06-06 2009-12-10 Ryan Steelberg Apparatus, system and method for a brand affinity engine using positive and negative mentions
WO2009152154A1 (en) * 2008-06-09 2009-12-17 J.D. Power And Associates Automatic sentiment analysis of surveys
US8190594B2 (en) * 2008-06-09 2012-05-29 Brightedge Technologies, Inc. Collecting and scoring online references
CA2727711A1 (en) * 2008-06-12 2009-12-17 Ryan Steelberg Barcode advertising
WO2009155375A2 (en) * 2008-06-19 2009-12-23 Wize, Inc. System and method for aggregating and summarizing product/topic sentiment
US8099408B2 (en) 2008-06-27 2012-01-17 Microsoft Corporation Web forum crawling using skeletal links
US8321300B1 (en) * 2008-06-30 2012-11-27 Bazaarvoice, Inc. Method and system for distribution of user generated content
US20110179009A1 (en) * 2008-09-23 2011-07-21 Sang Hyob Nam Internet-based opinion search system and method, and internet-based opinion search and advertising service system and method
EP2172884A1 (en) * 2008-09-25 2010-04-07 Motorola, Inc. Content item review management
WO2010036643A1 (en) * 2008-09-26 2010-04-01 Brand Affinity Technologies, Inc. An advertising request and rules-based content provision engine, system and method
AU2009296763A1 (en) * 2008-09-26 2010-04-01 Brand Affinity Technologies, Inc. An advertising request and rules-based content provision engine, system and method
WO2010039860A1 (en) * 2008-09-30 2010-04-08 Brand Affinity Technologies, Inc. System and method for brand affinity content distribution and placement
JP2012504829A (en) * 2008-10-01 2012-02-23 ライアン・スティールバーグ Local barcode advertisement
AU2009303824A1 (en) 2008-10-14 2010-04-22 Brand Affinity Technologies, Inc. Apparatus, system and method for a brand affinity engine using positive and negative mentions and indexing
CA2742788A1 (en) * 2008-11-06 2010-05-14 Brand Affinity Technologies, Inc. System and method for searching media assets
WO2010054201A1 (en) 2008-11-06 2010-05-14 Bazaarvoice Method and system for promoting user generation of content
US20100121702A1 (en) * 2008-11-06 2010-05-13 Ryan Steelberg Search and storage engine having variable indexing for information associations and predictive modeling
WO2010054234A1 (en) * 2008-11-06 2010-05-14 Brand Affinity Technologies, Inc. System and method for developing software and web based applications
WO2010056866A1 (en) * 2008-11-14 2010-05-20 Brand Affinity Technologies, Inc. System and method for brand affinity content distribution and optimization
US20100185616A1 (en) * 2009-01-14 2010-07-22 Cbs Interactive, Inc. Systems and methods for predictive recommendations
US20100185651A1 (en) * 2009-01-16 2010-07-22 Google Inc. Retrieving and displaying information from an unstructured electronic document collection
US8412749B2 (en) 2009-01-16 2013-04-02 Google Inc. Populating a structured presentation with new values
US8977645B2 (en) 2009-01-16 2015-03-10 Google Inc. Accessing a search interface in a structured presentation
US8452791B2 (en) * 2009-01-16 2013-05-28 Google Inc. Adding new instances to a structured presentation
US8615707B2 (en) * 2009-01-16 2013-12-24 Google Inc. Adding new attributes to a structured presentation
US8170958B1 (en) * 2009-01-29 2012-05-01 Intuit Inc. Internet reputation manager
WO2010090724A1 (en) * 2009-02-05 2010-08-12 Bazaarvoice Method and system for providing content generation capabilities
US20100235343A1 (en) * 2009-03-13 2010-09-16 Microsoft Corporation Predicting Interestingness of Questions in Community Question Answering
US20100235311A1 (en) * 2009-03-13 2010-09-16 Microsoft Corporation Question and answer search
US8713017B2 (en) 2009-04-23 2014-04-29 Ebay Inc. Summarization of short comments
FR2945651A1 (en) * 2009-05-15 2010-11-19 France Telecom DEVICE AND METHOD FOR UPDATING A USER PROFILE
US20100306223A1 (en) * 2009-06-01 2010-12-02 Google Inc. Rankings in Search Results with User Corrections
US20110106819A1 (en) * 2009-10-29 2011-05-05 Google Inc. Identifying a group of related instances
US20110004508A1 (en) * 2009-07-02 2011-01-06 Shen Huang Method and system of generating guidance information
US8645295B1 (en) 2009-07-27 2014-02-04 Amazon Technologies, Inc. Methods and system of associating reviewable attributes with items
US20110029926A1 (en) * 2009-07-30 2011-02-03 Hao Ming C Generating a visualization of reviews according to distance associations between attributes and opinion words in the reviews
US9686341B1 (en) * 2009-08-20 2017-06-20 A9.Com, Inc. Review trends
US8180755B2 (en) * 2009-09-04 2012-05-15 Yahoo! Inc. Matching reviews to objects using a language model
US8972436B2 (en) * 2009-10-28 2015-03-03 Yahoo! Inc. Translation model and method for matching reviews to objects
US9576251B2 (en) * 2009-11-13 2017-02-21 Hewlett Packard Enterprise Development Lp Method and system for processing web activity data
US8671029B2 (en) * 2010-01-11 2014-03-11 Ebay Inc. Method, medium, and system for managing recommendations in an online marketplace
US20110225203A1 (en) * 2010-03-11 2011-09-15 Board Of Trustees Of Michigan State University Systems and methods for tracking and evaluating review tasks
US8447823B2 (en) 2010-07-23 2013-05-21 Ebay Inc. Instant messaging robot to provide product information
US8346623B2 (en) * 2010-08-06 2013-01-01 Cbs Interactive Inc. System and method for navigating a collection of editorial content
CN102402519A (en) * 2010-09-09 2012-04-04 腾讯科技(深圳)有限公司 Search method, search result display method, search server and client
US9311619B2 (en) 2010-09-10 2016-04-12 Visible Technologies Llc Systems and methods for consumer-generated media reputation management
US8438149B1 (en) * 2010-09-21 2013-05-07 Amazon Technologies, Inc. Generating network pages for search engines
US9405825B1 (en) * 2010-09-29 2016-08-02 Amazon Technologies, Inc. Automatic review excerpt extraction
US8412771B2 (en) 2010-10-21 2013-04-02 Yahoo! Inc. Matching items of user-generated content to entities
US20120101721A1 (en) * 2010-10-21 2012-04-26 Telenav, Inc. Navigation system with xpath repetition based field alignment mechanism and method of operation thereof
US8682740B2 (en) 2010-10-26 2014-03-25 Cbs Interactive Inc. Systems and methods using a manufacturer line, series, model hierarchy
WO2012061748A1 (en) * 2010-11-05 2012-05-10 Bazaarvoice Method and system for distribution of content
CN102467709B (en) * 2010-11-17 2017-03-01 阿里巴巴集团控股有限公司 A kind of method and apparatus sending merchandise news
US8825813B2 (en) 2010-12-28 2014-09-02 Microsoft Corporation Distributed network coordinate system based on network performance
US20120209751A1 (en) * 2011-02-11 2012-08-16 Fuji Xerox Co., Ltd. Systems and methods of generating use-based product searching
US9672555B1 (en) 2011-03-18 2017-06-06 Amazon Technologies, Inc. Extracting quotes from customer reviews
US8554701B1 (en) 2011-03-18 2013-10-08 Amazon Technologies, Inc. Determining sentiment of sentences from customer reviews
US20120246092A1 (en) * 2011-03-24 2012-09-27 Aaron Stibel Credibility Scoring and Reporting
WO2012129775A1 (en) * 2011-03-29 2012-10-04 Google Inc. Aggregating product review information for electronic product catalogs
US9965470B1 (en) 2011-04-29 2018-05-08 Amazon Technologies, Inc. Extracting quotes from customer reviews of collections of items
US20120278318A1 (en) * 2011-05-01 2012-11-01 Reznik Alan M Systems and methods for facilitating enhancements to electronic group searches
US11841912B2 (en) 2011-05-01 2023-12-12 Twittle Search Limited Liability Company System for applying natural language processing and inputs of a group of users to infer commonly desired search results
US8700480B1 (en) * 2011-06-20 2014-04-15 Amazon Technologies, Inc. Extracting quotes from customer reviews regarding collections of items
US8838618B1 (en) * 2011-07-01 2014-09-16 Amazon Technologies, Inc. System and method for identifying feature phrases in item description information
US8606831B2 (en) * 2011-07-08 2013-12-10 Georgia Tech Research Corporation Systems and methods for providing reputation management
US8706732B1 (en) * 2011-07-12 2014-04-22 Google Inc. Managing information about entities using clusters of received observations
US8676804B1 (en) 2011-07-12 2014-03-18 Google Inc. Managing information about entities using observations generated from user modified values
US20130144705A1 (en) * 2011-07-31 2013-06-06 Coentre Ventures Llc Socially Networked Business and Consumer Space
US8671098B2 (en) 2011-09-14 2014-03-11 Microsoft Corporation Automatic generation of digital composite product reviews
US10402299B2 (en) 2011-11-02 2019-09-03 Microsoft Technology Licensing, Llc Configuring usage events that affect analytics of usage information
US9218417B2 (en) * 2011-11-02 2015-12-22 Microsoft Technology Licensing, Llc Ad-hoc queries integrating usage analytics with search results
US20140304189A1 (en) * 2011-11-16 2014-10-09 G2Link Llc Software and Method for Rating a Business
US9135350B2 (en) 2012-01-05 2015-09-15 Sri International Computer-generated sentiment-based knowledge base
US9400847B1 (en) * 2012-01-31 2016-07-26 Bazaarvoice, Inc. Method and system for pagination of content
US8996425B1 (en) 2012-02-09 2015-03-31 Audible, Inc. Dynamically guided user reviews
US9396490B1 (en) 2012-02-28 2016-07-19 Bazaarvoice, Inc. Brand response
CN103297462B (en) * 2012-02-28 2017-12-29 阿里巴巴集团控股有限公司 The verification method and device of a kind of business object
US10636041B1 (en) 2012-03-05 2020-04-28 Reputation.Com, Inc. Enterprise reputation evaluation
US8494973B1 (en) 2012-03-05 2013-07-23 Reputation.Com, Inc. Targeting review placement
US20130297383A1 (en) * 2012-05-03 2013-11-07 International Business Machines Corporation Text analytics generated sentiment tree
JP5910316B2 (en) * 2012-05-28 2016-04-27 ソニー株式会社 Information processing apparatus, information processing method, and program
US8515828B1 (en) * 2012-05-29 2013-08-20 Google Inc. Providing product recommendations through keyword extraction from negative reviews
US9177066B2 (en) * 2012-06-14 2015-11-03 Yahoo! Inc. Method and system for displaying comments associated with a query
US8918312B1 (en) 2012-06-29 2014-12-23 Reputation.Com, Inc. Assigning sentiment to themes
US9658824B1 (en) * 2012-07-02 2017-05-23 Amazon Technologies, Inc. Extracting topics from customer review search queries
US9607325B1 (en) * 2012-07-16 2017-03-28 Amazon Technologies, Inc. Behavior-based item review system
US9235858B1 (en) 2012-07-25 2016-01-12 Amazon Technologies, Inc. Local search of network content
CN103577452A (en) 2012-07-31 2014-02-12 国际商业机器公司 Website server and method and device for enriching content of website
US9619828B2 (en) * 2012-09-14 2017-04-11 Google Inc. Product review system and method
US20140136517A1 (en) * 2012-11-10 2014-05-15 Chian Chiu Li Apparatus And Methods for Providing Search Results
CN102984251B (en) * 2012-11-26 2016-08-03 北京奇虎科技有限公司 Realize the system and method for web page content review
CN102968487B (en) * 2012-11-26 2017-04-05 北京奇虎科技有限公司 A kind of browser for realizing web page content review, server and method
US9256593B2 (en) * 2012-11-28 2016-02-09 Wal-Mart Stores, Inc. Identifying product references in user-generated content
US20140156481A1 (en) * 2012-11-30 2014-06-05 Bazaarvoice, Inc. Using a financial account statement to present an opportunity to provide content related to a good or service
CN103870973B (en) * 2012-12-13 2017-12-19 阿里巴巴集团控股有限公司 Information push, searching method and the device of keyword extraction based on electronic information
CN103885994B (en) * 2012-12-24 2018-11-23 腾讯科技(深圳)有限公司 A kind of product control methods and device
US20150073875A1 (en) * 2013-01-30 2015-03-12 Ashfaq Rahman System and method for acquiring, processing and presenting information over the internet
CN103970783A (en) * 2013-01-31 2014-08-06 百度在线网络技术(北京)有限公司 LBS (Location Based Service)-based information acquisition method and equipment
CN103970784A (en) * 2013-01-31 2014-08-06 百度在线网络技术(北京)有限公司 Retrieval method and equipment
US10037409B2 (en) * 2013-02-27 2018-07-31 Mro Corporation Automated method and system for reducing the improper disclosure of documents
US8712907B1 (en) 2013-03-14 2014-04-29 Credibility Corp. Multi-dimensional credibility scoring
US8996391B2 (en) 2013-03-14 2015-03-31 Credibility Corp. Custom score generation system and methods
JP5654073B2 (en) * 2013-03-18 2015-01-14 楽天株式会社 Review management device, review management method, and review management program
US9665584B2 (en) 2013-03-28 2017-05-30 Linkedin Corporation System and method for recommending actions on a social network
CN104133830A (en) * 2013-05-02 2014-11-05 乐视网信息技术(北京)股份有限公司 Data obtaining method
US9615136B1 (en) 2013-05-03 2017-04-04 Amazon Technologies, Inc. Video classification
US9311363B1 (en) * 2013-05-15 2016-04-12 Google Inc. Personalized entity rankings
US10096045B2 (en) * 2013-05-31 2018-10-09 Walmart Apollo, Llc Tying objective ratings to online items
US9245026B1 (en) * 2013-06-26 2016-01-26 Amazon Technologies, Inc. Increasing the relevancy of search results across categories
US20150161686A1 (en) * 2013-07-26 2015-06-11 Kurtis Williams Managing Reviews
JP6147629B2 (en) * 2013-09-24 2017-06-14 Kddi株式会社 Page site server, program, and method for immediately displaying a point of interest for page content
CN103473359B (en) * 2013-09-26 2017-03-22 北京奇虎科技有限公司 Method and device for providing web page open type abstract by engine
US9910925B2 (en) * 2013-11-15 2018-03-06 International Business Machines Corporation Managing searches for information associated with a message
US20150169607A1 (en) * 2013-12-17 2015-06-18 Ebay Inc. Systems and methods to present images representative of searched items
CN103778235A (en) * 2014-01-26 2014-05-07 北京京东尚科信息技术有限公司 Method and device for processing commodity assessment information
US10332127B2 (en) * 2014-01-31 2019-06-25 Walmart Apollo, Llc Trend data aggregation
CN103823893A (en) * 2014-03-11 2014-05-28 北京大学 User comment-based product search method and system
WO2015145525A1 (en) * 2014-03-24 2015-10-01 楽天株式会社 Information processing device, information processing method, and program
US10332161B2 (en) 2014-06-05 2019-06-25 International Business Machines Corporation Retrieving reviews based on user profile information
US20160125511A1 (en) * 2014-06-30 2016-05-05 Ahmed Farouk Shaaban System and method for managing vendors, tax allocation and posting journal entries
CN104111990A (en) * 2014-07-02 2014-10-22 百度在线网络技术(北京)有限公司 Displaying method and device of search result card
CN104123358A (en) * 2014-07-17 2014-10-29 广州金山网络科技有限公司 User comment showing method and system
CN104182780B (en) * 2014-08-21 2018-07-03 五八同城信息技术有限公司 A kind of method and terminal device for automatically generating comment of having dinner
US20160085867A1 (en) * 2014-09-24 2016-03-24 Richard Gomez Method and system for aggregating opinions
US9705972B2 (en) * 2014-10-31 2017-07-11 International Business Machines Corporation Managing a set of data
US9846901B2 (en) * 2014-12-18 2017-12-19 Nuance Communications, Inc. Product recommendation with ontology-linked product review
US10380657B2 (en) 2015-03-04 2019-08-13 International Business Machines Corporation Rapid cognitive mobile application review
US10796319B2 (en) * 2015-04-07 2020-10-06 International Business Machines Corporation Rating aggregation and propagation mechanism for hierarchical services and products
US10223442B2 (en) * 2015-04-09 2019-03-05 Qualtrics, Llc Prioritizing survey text responses
TWI645348B (en) * 2015-05-26 2018-12-21 鴻海精密工業股份有限公司 System and method for automatically summarizing images and comments within commodity-related web articles
CN106294425B (en) * 2015-05-26 2019-11-19 富泰华工业(深圳)有限公司 Method and system for automatic graphic summarization of commodity-related web articles
US10127506B2 (en) 2015-08-07 2018-11-13 International Business Machines Corporation Determining users for limited product deployment based on review histories
US11164223B2 (en) 2015-09-04 2021-11-02 Walmart Apollo, Llc System and method for annotating reviews
US10140646B2 (en) * 2015-09-04 2018-11-27 Walmart Apollo, Llc System and method for analyzing features in product reviews and displaying the results
US20170091832A1 (en) * 2015-09-27 2017-03-30 International Business Machines Corporation Product component driven review aggregation
US20170091838A1 (en) 2015-09-30 2017-03-30 International Business Machines Corporation Product recommendation using sentiment and semantic analysis
US10339160B2 (en) 2015-10-29 2019-07-02 Qualtrics, Llc Organizing survey text responses
CN105488705A (en) * 2015-11-23 2016-04-13 深圳正品创想科技有限公司 Auxiliary system and method of online shopping
CN106934680A (en) * 2015-12-29 2017-07-07 阿里巴巴集团控股有限公司 A kind of method and device for business processing
US10699296B2 (en) * 2015-12-30 2020-06-30 Verizon Patent And Licensing, Inc. Native video advertising with voice-based ad management and machine-to-machine ad bidding
US10607233B2 (en) 2016-01-06 2020-03-31 International Business Machines Corporation Automated review validator
US20170277795A1 (en) * 2016-03-25 2017-09-28 Taylor Christensen Web page search bar
KR102246823B1 (en) 2016-04-15 2021-04-30 이베이 인크. How to Adopt Data Across Different Sites
US10600097B2 (en) 2016-06-30 2020-03-24 Qualtrics, Llc Distributing action items and action item reminders
US11645317B2 (en) 2016-07-26 2023-05-09 Qualtrics, Llc Recommending topic clusters for unstructured text documents
US11416907B2 (en) 2016-08-16 2022-08-16 International Business Machines Corporation Unbiased search and user feedback analytics
US10552497B2 (en) * 2016-08-16 2020-02-04 International Business Machines Corporation Unbiasing search results
US10579625B2 (en) 2016-09-15 2020-03-03 Walmart Apollo, Llc Personalized review snippet generation and display
CN106357801A (en) * 2016-10-24 2017-01-25 安徽百慕文化科技有限公司 Data communication interaction platform based on Internet of Things
CN106528832A (en) * 2016-11-15 2017-03-22 南京明鉴智能科技有限公司 Establishment method for comment indexing database and comment indexing system
US10331436B2 (en) 2017-03-20 2019-06-25 International Business Machines Corporation Smart reviews for applications in application stores
US10789313B2 (en) * 2017-03-30 2020-09-29 Innoplexus Ag Method and system for extracting information related to product
CN107358510A (en) * 2017-07-19 2017-11-17 骆德轩 One kind transaction evaluation method and equipment
CN107861411A (en) * 2017-09-25 2018-03-30 深圳市赛亿科技开发有限公司 The control method and its control system of a kind of intelligent bookshelf
US10416993B2 (en) 2017-10-06 2019-09-17 International Business Machines Corporation Mobile application update manager
JP6906430B2 (en) * 2017-11-20 2021-07-21 ヤフー株式会社 Information processing equipment, information processing methods and information processing programs
CN109934644A (en) * 2017-12-15 2019-06-25 西安比卓电子科技有限公司 A kind of information creating method
CN109933757A (en) * 2017-12-15 2019-06-25 西安比卓电子科技有限公司 A kind of comment inquiry edit methods
CN109934643A (en) * 2017-12-15 2019-06-25 西安比卓电子科技有限公司 A kind of review record method
CN108520441A (en) * 2018-04-04 2018-09-11 网易无尾熊(杭州)科技有限公司 Data processing method, medium, system and computing device
US20190385199A1 (en) * 2018-06-18 2019-12-19 International Business Machines Corporation Review and recommendation filtering based on user fitness metric
CN110008395B (en) * 2018-09-17 2021-11-02 北京字节跳动网络技术有限公司 Comment content presentation method and device, storage medium and terminal
US20200143437A1 (en) * 2018-11-06 2020-05-07 Luke Luechtefeld Systems and methods for individualized restaurant rating
CN111259271B (en) * 2018-12-03 2023-04-25 阿里巴巴集团控股有限公司 Comment information display method and device, electronic equipment and computer readable medium
US11107092B2 (en) * 2019-01-18 2021-08-31 Sprinklr, Inc. Content insight system
CN110134763B (en) * 2019-04-26 2021-06-29 杭州飞弛网络科技有限公司 Method and system for classified display of evaluation information of strangers' social activities
US11373220B2 (en) * 2019-05-07 2022-06-28 Capital One Services, Llc Facilitating responding to multiple product or service reviews associated with multiple sources
US11715134B2 (en) 2019-06-04 2023-08-01 Sprinklr, Inc. Content compliance system
US11144730B2 (en) 2019-08-08 2021-10-12 Sprinklr, Inc. Modeling end to end dialogues using intent oriented decoding
CN110837611B (en) * 2019-09-16 2023-03-24 珠海格力电器股份有限公司 Method, device and equipment for acquiring evaluation information and storage medium
US12159301B2 (en) 2020-02-26 2024-12-03 International Business Machines Corporation Text processing based interface accelerating
CN111753160A (en) * 2020-05-21 2020-10-09 张家昊 Online shopping after-sale evaluation system
US11568004B2 (en) * 2020-07-14 2023-01-31 Google Llc Comparative search within user-generated content
CN112131346B (en) * 2020-09-25 2024-04-30 北京达佳互联信息技术有限公司 Comment aggregation method and device, storage medium and electronic equipment
KR102340807B1 (en) * 2021-03-30 2021-12-20 쿠팡 주식회사 Operating method for electronic apparatus for offering item information and electronic apparatus supporting thereof
CN113157970A (en) * 2021-04-26 2021-07-23 北京字节跳动网络技术有限公司 Information display method and device and computer storage medium
JP2023014409A (en) * 2021-07-16 2023-01-27 Lazuli株式会社 Product name identification system, name identification product master generation method, and program
KR20240177105A (en) * 2023-06-19 2024-12-27 쿠팡 주식회사 Method, apparatus, and recording medium for processing data of electronic commerce service
US20250094728A1 (en) * 2023-09-15 2025-03-20 Shopify Inc. Summary of reviews generated by a generative language model

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6405175B1 (en) * 1999-07-27 2002-06-11 David Way Ng Shopping scouts web site for rewarding customer referrals on product and price information with rewards scaled by the number of shoppers using the information
US20070294127A1 (en) * 2004-08-05 2007-12-20 Viewscore Ltd System and method for ranking and recommending products or services by parsing natural-language text and converting it into numerical scores

Family Cites Families (30)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5241671C1 (en) * 1989-10-26 2002-07-02 Encyclopaedia Britannica Educa Multimedia search system using a plurality of entry path means which indicate interrelatedness of information
US6978277B2 (en) * 1989-10-26 2005-12-20 Encyclopaedia Britannica, Inc. Multimedia search system
US5339257A (en) * 1991-05-15 1994-08-16 Automated Technology Associates Inc. Real-time statistical process monitoring system
US7082426B2 (en) * 1993-06-18 2006-07-25 Cnet Networks, Inc. Content aggregation method and apparatus for an on-line product catalog
WO1995012173A2 (en) 1993-10-28 1995-05-04 Teltech Resource Network Corporation Database search summary with user determined characteristics
US6029195A (en) * 1994-11-29 2000-02-22 Herz; Frederick S. M. System for customized electronic identification of desirable objects
US5855015A (en) 1995-03-20 1998-12-29 Interval Research Corporation System and method for retrieval of hyperlinked information resources
US5768142A (en) * 1995-05-31 1998-06-16 American Greetings Corporation Method and apparatus for storing and selectively retrieving product data based on embedded expert suitability ratings
US5983220A (en) * 1995-11-15 1999-11-09 Bizrate.Com Supporting intuitive decision in complex multi-attributive domains using fuzzy, hierarchical expert models
US5842206A (en) 1996-08-20 1998-11-24 Iconovex Corporation Computerized method and system for qualified searching of electronically stored documents
DE50001172D1 (en) * 1999-04-01 2003-03-06 Oce Printing Systems Gmbh DEVICE AND METHOD FOR PROCESSING AND PRINTING INFORMATION
US6957186B1 (en) 1999-05-27 2005-10-18 Accenture Llp System method and article of manufacture for building, managing, and supporting various components of a system
US8271336B2 (en) 1999-11-22 2012-09-18 Accenture Global Services Gmbh Increased visibility during order management in a network-based supply chain environment
WO2001046887A1 (en) * 1999-12-23 2001-06-28 My-E-Surveys.Com, Llc System and methods for internet commerce and communication based on customer interaction and preferences
US6963848B1 (en) 2000-03-02 2005-11-08 Amazon.Com, Inc. Methods and system of obtaining consumer reviews
US20010044837A1 (en) * 2000-03-30 2001-11-22 Iqbal Talib Methods and systems for searching an information directory
US20020052783A1 (en) * 2000-06-26 2002-05-02 Turek David G. Method and apparatus for establishing a customized electronic site
US6691112B1 (en) * 2000-08-01 2004-02-10 Darren Michael Siegal Method for indexing and managing a searchable community of non-HTML information
US7277926B1 (en) 2000-09-28 2007-10-02 International Business Machines Corporation Business method and user interface for representing business analysis information side-by-side with product pages of an online store
WO2002056144A2 (en) * 2001-01-09 2002-07-18 Topcoder Inc Systems and methods for coding competitions
US20020152114A1 (en) * 2001-04-12 2002-10-17 Shumaker Lance C. System and method for updating an intranet portal
US6991112B2 (en) * 2002-03-05 2006-01-31 Romanow Enterprises, Inc. Disposable filter cartridge
US20030195754A1 (en) 2002-04-10 2003-10-16 Ford John E. Product deviation request tracking system
US7346549B2 (en) * 2002-06-27 2008-03-18 At&T Knowledge Ventures, L.P. System and method for wirelessly transacting access to a set of events and associated digital content/products
CN1395206A (en) 2002-08-23 2003-02-05 北京大学 A method and system for collecting, analyzing and providing online information
WO2004038560A2 (en) * 2002-10-23 2004-05-06 Capital Surini Group International Systems and methods for clinical trials information management
WO2004070507A2 (en) 2003-02-07 2004-08-19 Power Measurement Ltd. A method and system for calculating and distributing utility costs
US7240049B2 (en) * 2003-11-12 2007-07-03 Yahoo! Inc. Systems and methods for search query processing using trend analysis
US20050154769A1 (en) 2004-01-13 2005-07-14 Llumen, Inc. Systems and methods for benchmarking business performance data against aggregated business performance data
US20050197893A1 (en) 2004-02-24 2005-09-08 Michael Landau Coupon, price-comparison, and product-review information toolbar for use with a network browser or system/application interface

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6405175B1 (en) * 1999-07-27 2002-06-11 David Way Ng Shopping scouts web site for rewarding customer referrals on product and price information with rewards scaled by the number of shoppers using the information
US20070294127A1 (en) * 2004-08-05 2007-12-20 Viewscore Ltd System and method for ranking and recommending products or services by parsing natural-language text and converting it into numerical scores

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
Rotten Tomatoes (RT) (http://www.rottentomatoes.com/m/BadBoysII-1123937/reviews.php as of 8/11/2003 *

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060277290A1 (en) * 2005-06-02 2006-12-07 Sam Shank Compiling and filtering user ratings of products
US11010828B2 (en) 2011-06-22 2021-05-18 Rakuten, Inc. Information processing apparatus, information processing method, information processing program, recording medium having stored therein information processing program
US20130254181A1 (en) * 2012-01-03 2013-09-26 Be Labs, Llc Aggregation and Categorization
US20150178713A1 (en) * 2013-12-23 2015-06-25 Mastercard International Incorporated Method and system of providing financial transaction card related mobile apps
US9646297B2 (en) * 2013-12-23 2017-05-09 Mastercard International Incorporated Method and system of providing financial transaction card related mobile apps
US10325274B2 (en) * 2014-01-31 2019-06-18 Walmart Apollo, Llc Trend data counter
US20150264093A1 (en) * 2014-03-14 2015-09-17 ResearchGate Corporation Publication review user interface and system
US10389767B2 (en) * 2014-03-14 2019-08-20 Researchgate Gmbh Publication review user interface and system
US11611596B2 (en) * 2014-03-14 2023-03-21 Researchgate Gmbh Publication review user interface and system
US20160283497A1 (en) * 2015-03-27 2016-09-29 Ca, Inc. Analysis of mobile application reviews based on content, reviewer credibility, and temporal and geographic clustering
US10423651B2 (en) * 2015-03-27 2019-09-24 Ca, Inc. Analysis of mobile application reviews based on content, reviewer credibility, and temporal and geographic clustering

Also Published As

Publication number Publication date
US20060129446A1 (en) 2006-06-15
US20120158735A1 (en) 2012-06-21
US7962461B2 (en) 2011-06-14
CN101151607B (en) 2012-11-14
US20110208669A1 (en) 2011-08-25
US8380694B2 (en) 2013-02-19
CN101151607A (en) 2008-03-26
US8161030B2 (en) 2012-04-17

Similar Documents

Publication Publication Date Title
US8161030B2 (en) Method and system for aggregating reviews and searching within reviews for a product
US20060143158A1 (en) Method, system and graphical user interface for providing reviews for a product
CN104361111B (en) A kind of archives are compiled and grind method automatically
US9672283B2 (en) Structured and social data aggregator
US8041601B2 (en) System and method for automatically targeting web-based advertisements
JP5300960B2 (en) Adding attributes and labels to structured data
US8126868B1 (en) Search rankings with dynamically customized content
US8775396B2 (en) Method and system for searching a wide area network
US20010049674A1 (en) Methods and systems for enabling efficient employment recruiting
CN102859518B (en) Information processing device, information processing method
CN105765573B (en) Improvements in website traffic optimization
US20070294240A1 (en) Intent based search
US20100030647A1 (en) Advertisement selection for internet search and content pages
CN101233513A (en) Systems and methods for reordering result sets
US20050267872A1 (en) System and method for automated mapping of items to documents
US20070136256A1 (en) Method and apparatus for representing text using search engine, document collection, and hierarchal taxonomy
US20130086052A1 (en) Method and System of Information Matching in Electronic Commerce Website
WO2001044992A9 (en) Context matching system and method
KR20080066818A (en) How to Retrieve Structured Data
JP5985543B2 (en) Information aggregation device, information aggregation method, and information aggregation program
EP1834249A2 (en) Method, system and graphical user interface for providing reviews for a product
WO2002041182A1 (en) Interesting news item distributing system and interesting news item distributing method
TWI616761B (en) Information matching method and system applied to e-commerce website
JP2017168154A (en) Information aggregation device, information aggregation method, and information aggregation program
JP6171061B2 (en) Information aggregation device, information aggregation method, and information aggregation program

Legal Events

Date Code Title Description
STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION