[go: up one dir, main page]

US20110307512A1 - Disambiguation with respect to multi-grained dimension coordinates - Google Patents

Disambiguation with respect to multi-grained dimension coordinates Download PDF

Info

Publication number
US20110307512A1
US20110307512A1 US13/217,206 US201113217206A US2011307512A1 US 20110307512 A1 US20110307512 A1 US 20110307512A1 US 201113217206 A US201113217206 A US 201113217206A US 2011307512 A1 US2011307512 A1 US 2011307512A1
Authority
US
United States
Prior art keywords
grain
value
dimension
coarser
finer
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US13/217,206
Inventor
Todd O. Dampier
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Merced Systems Inc
Original Assignee
Merced Systems Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Merced Systems Inc filed Critical Merced Systems Inc
Priority to US13/217,206 priority Critical patent/US20110307512A1/en
Publication of US20110307512A1 publication Critical patent/US20110307512A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2477Temporal data queries
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • G06F16/24553Query execution of query operations
    • G06F16/24554Unary operations; Data partitioning operations
    • G06F16/24556Aggregation; Duplicate elimination

Definitions

  • the present invention is in the field of processing a report query to a dimensionally-modeled fact collection (i.e., facts of or derived from a collection of facts organized as, or otherwise accessible according to, a dimensional data model).
  • the present invention relates to reporting on facts considering the phenomena in which a grain value at a second grain (“finer grain values” at a “finer grain”) of one or more dimension coordinates satisfying a dimension coordinate constraint of the report query may also be the finer grain value for other dimension coordinates that also satisfy the dimension coordinate constraint but that have a different grain value at a first grain that is coarser than the finer grain (“coarser grain value” at a “coarser grain”).
  • the terminology of dimension coordinates, grains and grain values are discussed below.
  • a coordinate of a LOCATION dimension comprises the following grains: CONTINENT, COUNTRY and CITY.
  • the order of the grains may have some hierarchical significance.
  • the grains are generally ordered such that finer grains are hierarchically “nested” inside coarser grains.
  • the CITY grain may be finer than the COUNTRY grain
  • the COUNTRY grain may be finer than the CONTINENT grain.
  • the order of the grains of a dimension has hierarchical significance, the value of a coordinate of that dimension, at a particular finer grain, is nominally such that the value of the coordinate of that dimension has only one value at any coarser grain for that dimension.
  • a value of a coordinate of a LOCATION dimension may be specified at the CITY grain of the LOCATION dimension by the value “Los Angeles.” This same coordinate has only one value at the COUNTRY and CONTINENT grains: “United States” and “North America,” respectively.
  • Processing a report query to a dimensional data model includes processing a plurality of dimension coordinates that exist within the dimensional data model.
  • Each of the plurality of dimension coordinates has a second particular grain (“finer grain”) that is finer than a first particular grain (“coarser grain”).
  • the report query specifies a dimension coordinate constraint to which the plurality of dimension coordinates correspond.
  • a subset of the plurality of dimension coordinates are dimension coordinates for which there is ambiguity as to what coarser grain value to associate with the finer grain value. That is, for the subset of the plurality of dimension coordinates, each of the dimension coordinates of the subset is such that there is at least one other dimension coordinate of the subset having a value at the finer grain (“finer grain value”) that is the same as the finer grain value of that dimension coordinate, and the at least one other dimension coordinate also has a value at the coarser grain (“coarser grain value”) that is different from the coarser grain value of that dimension coordinate.
  • finer grain finer grain
  • coarser grain value coarser grain
  • the coarser grain value For every unique finer grain value of the dimension coordinates of the subset, it is determined what coarser grain value to associate with all dimension coordinates of the subset having that finer grain value.
  • the determined coarser grain value is the coarser grain value of one of the dimension coordinates, of the subset, having that finer grain value.
  • the coarser grain value associated with that dimension coordinate is the coarser grain value of that dimension coordinate.
  • a report is generated in view of the plurality of dimension coordinates and their associated coarser grain values.
  • FIG. 1 graphically illustrates a simple situation in which there is only one finer grain value for which there is an ambiguity and, further, the ambiguity is between only two possible coarser grain values.
  • FIG. 2 graphically illustrates an example in which, similar to the FIG. 1 example, disambiguation separately occurs with respect to disambiguation time chambers for each time reporting label of the time reporting range of the report query.
  • FIG. 3 graphically illustrates an example in which a disambiguation occurs for a disambiguation time chamber that spans more than one time reporting label.
  • FIG. 4 is a block diagram illustrating an example architecture of a system in which reporting of facts of a dimensionally-modeled fact collection may be performed, including disambiguating as desired or as otherwise determined to be appropriate.
  • FIG. 5 is a flowchart illustrating an example of multiple-pass processing including disambiguation.
  • the inventors have realized that it is desirable to consider the phenomenon in which, for a subset of a plurality of dimension coordinates that satisfy a report query, there are dimension coordinates of the subset that have the same grain value at a finer grain but a different grain value at a coarser grain. In this case, when performing operations with respect to dimension coordinates of this subset, there is ambiguity as to what coarser grain value to associate with the finer grain value.
  • This phenomenon may arise, for example, when one or more dimensions in which the dimension coordinates exist is a slowly changing dimension. This is a phenomenon in which the relationship of grains for a dimension may change over time. While it may be contrived to consider the concept of slowly changing dimensions with reference to the example LOCATION dimension (since, generally, the relationship of CONTINENT, COUNTRY and CITY grains will not change over time), there are other more realistic examples of this phenomenon.
  • the EMPLOYEE dimension comprises the following grains: ORGANIZATION, DIVISION, TEAM and PERSON.
  • ORGANIZATION the following grains
  • DIVISION the following grains
  • TEAM the following grains
  • PERSON the following grains
  • values of coordinates at various grains may change as a person moves from one team to another team (or, perhaps, a team moves from one division to another division). For example, at the beginning of one quarter, Bill worked on the Red Team; sometime during the quarter, Bill moved to the Blue Team.
  • This may be modeled by one EMPLOYEE dimension coordinate having the value “Bill” at grain PERSON and the value “Red Team” at grain TEAM, plus a second EMPLOYEE dimension coordinate also having the value “Bill” at grain PERSON but the value “Blue Team” at grain TEAM. It is also possible to encode in the representation of the dimension coordinates the specific time intervals during which these grain relationships obtained.
  • the ambiguity about Bill's team membership during the Q4 2005 time period can be arbitrarily disambiguated. For example, all of Bill's cookie eating metric values for the Q4 2005 time period could be attributed to the Red Team, even metric values for cookies eaten by Bill while Bill was on the Blue Team:
  • dimension coordinates having a value of Bill at the PERSON grain are to be treated as having a value of Red Team at the TEAM grain, then even the dimension coordinate having a value of Bill at the PERSON grain and having a value of Blue Team at the TEAM grain will be treated as having a value of Red Team at the TEAM grain.
  • each dimension coordinate of the subset is such that there is at least one other dimension coordinate of the subset having a finer grain value that is the same as the finer grain value of that dimension coordinate (e.g., Bill at the PERSON grain) and the at least one other dimension coordinate also has a coarser grain value that is different from the coarser grain value of that dimension coordinate (e.g., another dimension coordinate has a value of Red Team at the TEAM grain and that dimension coordinate has a value of Blue Team at the TEAM grain).
  • a finer grain value that is the same as the finer grain value of that dimension coordinate (e.g., Bill at the PERSON grain)
  • the at least one other dimension coordinate also has a coarser grain value that is different from the coarser grain value of that dimension coordinate (e.g., another dimension coordinate has a value of Red Team at the TEAM grain and that dimension coordinate has a value of Blue Team at the TEAM grain).
  • the coarser grain value to associate with all dimension coordinates of the subset having that finer grain value is considered to be the coarser grain value of one of the dimension coordinates, of the subset, having that finer grain value (e.g., the coarser grain value to associate with the finer grain value of Bill is considered to be either Red Team or Blue Team).
  • FIG. 1 illustrates this aspect graphically.
  • the PERSON grain is the finer grain and the TEAM grain is the coarser grain.
  • the dimension coordinate 102 and the dimension coordinate 104 are considered to be dimension coordinates of a “subset.”
  • each dimension coordinate of a subset is such that there is at least one other dimension coordinate of the subset having a finer grain value that is the same as the finer grain value of that dimension coordinate and the at least one other dimension coordinate also has a coarser grain value that is different from the coarser grain value of that dimension coordinate.
  • the dimension coordinate 102 and the dimension coordinate 104 each have the value Bill at the PERSON grain (finer grain), but the dimension coordinate 102 and the dimension coordinate 104 have different values at the TEAM grain. That is, the dimension coordinate 102 has the value Blue Team at the TEAM grain, and the dimension coordinate 104 has the value Red Team at the TEAM grain.
  • Some mechanism has been used to determine and process the time period by which the dimension coordinates 102 and 104 are characterized and, thus, to associate each of the dimension coordinates 102 and 104 (and, perhaps, one or more dimension coordinates that are not shown, for which there is no ambiguity as to what coarser grain value to associate with the finer grain values) with a particular time reporting label.
  • the (one and only) particular time reporting label is Q4 2005.
  • block 106 represents the coarser grain value of Blue Team
  • block 108 represents the coarser grain value of Red Team.
  • Blue Team and Red Team are each a possible coarser grain value to associate with the finer grain value of Bill, which is the finer grain value at the PERSON grain of both the dimension coordinate 102 and the dimension coordinate 104 .
  • the “switch” 110 graphically represents a result of a disambiguation determination 112 as to which of the Blue Team value and the Red Team value is to be associated with the finer grain value of Bill, at the PERSON grain.
  • the switch 110 is figuratively positioned such that the Blue Team value 106 is associated with the value of Bill at the PERSON grain for the dimension coordinate 102 and the dimension coordinate 104 , even though the dimension coordinate 104 has an actual value of Red Team at the TEAM grain. Referring to the examples above—computing the average number of cookies eaten by each team's members for the Q4 2005 time period—this would result in processing the dimension coordinates as set forth with respect to Result 1-3 above.
  • the switch 110 is figuratively positioned such that the Red Team value is associated with the value of Bill at the PERSON grain for both the dimension coordinate 102 and the dimension coordinate 104 , even though the dimension coordinate 102 has an actual value of Blue Team at the TEAM grain.
  • FIG. 1 represents a simple situation in which, for a particular subset of dimension coordinates, there is only one finer grain value for which there is an ambiguity as to an associated coarser grain value and, further, the ambiguity is between only two possible coarser grain values.
  • an ambiguity may be between more than two possible coarser grain values. Where an ambiguity is between more than two possible coarser grain values, the disambiguation results in a single one of the possible coarser grain values being associated with a particular finer grain value.
  • FIG. 1 illustrates an example where not only the dimension coordinates of the subset, but also the dimension coordinates of the larger group to which the subset belongs, have all been determined to be associated with a single particular time period.
  • the example is one in which each of the dimension coordinates considered for disambiguation corresponds to the time period of the single Q4 2005 time reporting label.
  • FIG. 2 unlike the FIG. 1 example, FIG. 2 exhibits an example for which there is more than a single time reporting label. That is, with respect to FIG. 2 , a report is on the average number of cookies eaten by each team's members during Q4 2005, the time reporting range, reported on a monthly basis. The number of time reporting labels is three—OCT-2005, NOV-2005 and DEC-2005.
  • each time reporting label corresponds to a separate non-overlapping time period, namely, the time periods associated with the OCT-2005, NOV-2005 and DEC-2005 time periods.
  • each dimension coordinate satisfying the dimension coordinate constraint of a report query is associated with one of the separate non-overlapping time periods which we call “disambiguation time chambers.”
  • Each disambiguation time chamber corresponds to a different non-overlapping time period of the time reporting range, and the subsets for which there is disambiguation exist on a disambiguation time chamber by disambiguation time chamber basis, based on a correspondence between a time period with which a dimension coordinate is associated and a time period associated with a disambiguation time chamber.
  • FIG. 2 illustrates a simple example, in which the disambiguation time chambers are at the same resolution as the time reporting labels and, thus, the disambiguation time chambers coincide with the time reporting labels. Since the disambiguation time chambers coincide with time reporting labels in the FIG. 2 example, not only do the subsets exist on a disambiguation time chamber by disambiguation time chamber basis, it also follows that the subsets exist on a time reporting label by time reporting label basis. In the FIG. 2 example, there may be a subset for which there is disambiguation for each of the OCT-2005, NOV-2005 and DEC-2005 time periods. By contrast, we explain later with respect to the FIG.
  • each disambiguation time chamber may simultaneously correspond to two or more time reporting labels.
  • a particular dimension coordinate may be associated with more than one of the time periods with which time reporting labels are associated. We will note an example of this with reference to Table 2, later in this description.
  • a time period to which each separate set of dimension coordinates corresponds is defined by a time period to which one or more of the time reporting labels corresponds.
  • there is one disambiguation time chamber and it corresponds to the Q4 2005 time period.
  • there are three disambiguation time chambers and the three disambiguation time chambers correspond to the OCT-2005, NOV-2005, and DEC-2005, time periods, respectively.
  • a disambiguation time chamber be defined by the time period to which one of the time reporting labels corresponds but, also, a disambiguation time chamber may be defined by the time period to which more than one of the time reporting labels collectively correspond (or, put another way, a disambiguation time chamber may correspond to one or more time reporting labels).
  • the disambiguation may be among two or more coarser grain values (e.g., the disambiguation may be among Red Team and Blue team, or the disambiguation may be Red Team, Blue Team and Green Team).
  • an example of such a dimension coordinate includes the dimension coordinate having the value Mary at the PERSON grain and having the value Red Team at the TEAM grain. This dimension coordinate is associated with all of the following time reporting labels: OCT 2005, NOV 2005 and DEC 2005.)
  • the cookie eating metric values could be left “attached” to both the PERSONs and TEAMs to which it accrued (i.e., no disambiguation), and an average per team, per each time reporting label, could be computed as:
  • the TEAM value of Red Team could be attributed to Bill for the NOV-2005 time reporting label.
  • dimension coordinates associated with the NOV-2005 disambiguation time chamber are the only dimension coordinates for which an ambiguity exists as to coarser grain values associated with particular finer grain values.
  • the disambiguation for the dimension coordinates associated with the disambiguation time chamber defined by the NOV-2005 time period could result in the TEAM value of Blue Team being attributed to Bill for the NOV-2005 time reporting label.
  • the disambiguation time chambers each correspond to a separate single respective time reporting label
  • the disambiguation time chambers each correspond to more than one time reporting label.
  • the reporting labels may be at a month resolution (e.g., JAN-2005, FEB-2005, . . . , NOV-2005 and DEC-2005) of the time dimension.
  • the disambiguation time chambers may on the other hand, be at a quarter resolution (e.g., Q11 2005, Q2 2005, Q3 2005 and Q4 2005). In other words, all the dimension coordinates characterized by a time period that corresponds to any time reporting label for a month in a particular quarter would be associated with that particular quarter for disambiguation purposes.
  • FIG. 3 illustrates such an example.
  • dimension coordinates are shown that are associated with time periods corresponding to the time reporting labels JAN-2005 ( 302 ), FEB-2005 ( 304 ), MAR-2005 ( 306 ), APR-2005 ( 312 ), MAY-2005 ( 314 ) and JUN-2005 ( 316 ).
  • Dimension coordinates associated with time periods corresponding to the remaining time reporting labels for 2005 are not shown, to simplify the illustration.
  • the disambiguation time chambers would be defined by each quarter-year time period of the whole year, for a total of four disambiguation time chambers. Again, there are four disambiguation time chambers for the 2005 time period, even though there are twelve time reporting labels for the 2005 time period. That is, the disambiguation decision for each PERSON grain entity could thus be made up to four times, once for each quarter (e.g., determinations 308 and 318 for the first and second quarter, respectively), even though the report processing is carried out twelve times, once for each “month” time reporting label.
  • Red Team's bonus budget is $3K
  • Green Team's bonus budget is $5k
  • Blue Team's bonus budget is $7k.
  • various criteria may be considered, with varying results as to the coarser grain value (value at TEAM grain) to associate with the finer grain value of Alice (value at PERSON grain).
  • the result is Blue Team. If the criterion equals “earliest team,” then the result equals Red Team. It is noted that the “latest team” and “earliest team” criteria are time based. Other criteria may include, for example, “longest team membership during time of disambiguation time chamber.” For this criterion, the results equals “Green Team.” For the criterion of “highest bonus budget,” the result equals Blue Team. For the criterion of “team on which she ate the most cookies,” the result equals “Green Team.”
  • processing for generating the report may include associating metric values in a “copying down” or “rolling up” direction.
  • Rolling up includes associating, with a coarser grain value, a metric value that corresponds to a finer grain value (i.e., rolling up from finer to coarser) with which that coarser grain value is associated.
  • Determining, for each month, the average number of cookies per person (metric value that corresponds to a finer grain value—at PERSON grain) eaten by each of Red Team and Blue Team (coarser grain value—at TEAM grain) is an example of rolling up.
  • “rolling up” includes associating in a many (finer grain values) to one (coarser grain value) manner.
  • Copying down includes associating, with finer grain values, a metric value that corresponds to a coarser grain value (i.e., copying down from coarser to finer).
  • An example of copying down includes, for each month, associating the team goal (coarser grain value—at TEAM grain) for every person (finer grain value—at PERSON grain).
  • copying down includes associating in a one (coarser grain value) to many (finer grain value) manner.
  • the disambiguation is useful for resolving what coarser grain value is to be associated with finer grain values, for which the association may otherwise be ambiguous.
  • FIG. 4 is a block diagram illustrating an example architecture of a system 400 in which reporting of facts of a dimensionally-modeled fact collection may be performed, including disambiguating as desired or as otherwise determined to be appropriate.
  • a user 402 may cause a report query 404 to be provided to a fact collection query generator 406 .
  • the user 402 may interact with a web page via a web browser, where the web page is served by a report user interface using, for example, a Java Server Page mechanism.
  • the user 402 interacts with the web page such that the report query 404 is provided to the fact collection query generator 406 .
  • the report query 104 includes a dimension coordinate constraint, which may be one or more dimension coordinate constraints.
  • a dimension coordinate constraint for a dimension of the dimensionally-modeled fact collection specifies coordinates of that dimension of the dimensionally-modeled fact collection.
  • a dimension coordinate constraint may specify coordinates of that dimension of the dimensionally-modeled fact collection by specifying a value of the dimension at a particular grain.
  • Dimension coordinate constraints of the report query 404 then, specify a plurality of coordinates of one or more dimensions of the dimensionally-modeled fact collection, on which a report is to be based.
  • the fact collection query generator 406 processes the report query 404 to generate an appropriate corresponding fact collection query 408 , which is presented to the dimensionally-modeled fact collection 410 .
  • a result 416 of presenting the fact collection query 408 to the dimensionally-modeled fact collection 410 is processed by a report generator 418 to generate a report corresponding to the report query 404 caused to be provided by the user 402 .
  • the generated report includes an indication of processing with respect to dimensional members as appropriate in view of the dimension coordinate constraints of the report query 404 .
  • the dimensionally-modeled fact collection 410 is implemented as a relational database, storing fact data in a manner that is accessible to users according to a ROLAP—Relational Online Analytical Processing—schema (fact and dimension tables).
  • the fact collection query 408 may originate as a database query, in some form that is processed into another form, for example, which is processed by an OLAP query engine into a fact collection query 408 , presented as an SQL query that is understandable by the underlying relational database. This is just one example, however, and there are many other ways of representing and accessing a dimensionally-modeled fact collection.
  • Processing 418 is applied to the fact collection result 416 to generate a report.
  • the generated report includes an indication of dimension members and facts corresponding to those indicated dimension members. What facts are reported may depend, at least in part, on disambiguation of what coarser grain value is determined to correspond to particular finer grain values.
  • the composition of the generated report may be accomplished by the fact collection query generator 406 particularly generating the fact collection query 408 in accordance with the report query, by the result processing 418 particularly processing the fact collection result (e.g., by applying filtering) in accordance with the report query, or by a combination of both.
  • the report query 404 may include a disambiguation criterion that may be provided, for example, via a user interface.
  • a disambiguation criterion may be provided, for example, via a user interface.
  • the manner in which the facts corresponding to those indicated dimension members are reported may be according to a default mode or according to a preconfigured mode.
  • the fact collection query generator 406 and/or the result processing 418 operate according to the default, preconfigured or designated mode.
  • a multiple-pass processing is utilized.
  • An example of the multiple-pass processing is illustrated by the flowchart of FIG. 5 .
  • every dimension coordinate (which may be one or more dimension coordinates) that satisfies the dimension coordinate constraint is determined.
  • the determined dimension coordinates have a particular value at a particular grain.
  • step 506 for every unique finer grain value of the dimension coordinates of the subset, it is determined what coarser grain value to associate with all dimension coordinates of the subset having that finer grain value.
  • the determined coarser grain value is the coarser grain value of one of the dimension coordinates, of the subset, having that finer grain value.
  • the coarser grain value associated with that dimension coordinate is the coarser grain value of that dimension coordinate.
  • Steps 504 and 506 may be repeated for additional disambiguation time chambers. That is, the subset would be determined for each additional disambiguation time chamber and disambiguation carried out as appropriate on that subset.
  • a report is generated in view of the plurality of dimension coordinates and their associated coarser grain values.
  • a subset of a plurality of dimension coordinates satisfying a dimension coordinate constraint of a report query are such that, for each dimension coordinate of the subset, there is at least one other dimension coordinate of the subset having a finer grain value that is the same as the finer grain value of that dimension coordinate and the at least one other dimension coordinate also has a coarser grain value that is different from the coarser grain value of that dimension coordinate.
  • disambiguation may be carried out such that, for every unique finer grain value of the dimension coordinates of the subset, the coarser grain value to associate with all dimension coordinates of the subset having that finer grain value is determined, wherein the determined coarser grain value is the coarser grain value of one of the dimension coordinates, of the subset, having that finer grain value.
  • a report may be generated in view of the plurality of dimension coordinates and their associated coarser grain values.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Physics & Mathematics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Software Systems (AREA)
  • Mathematical Physics (AREA)
  • Fuzzy Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

Each of a plurality of dimension coordinates corresponding to a report query has a finer grain and a coarser grain. A subset of the dimension coordinates are dimension coordinates for which there is ambiguity as to what coarser grain value should be associated with the finer grain value. For every unique finer grain value of the dimension coordinates of the subset, it is determined what coarser grain value to associate with all dimension coordinates of the subset having that finer grain value. The determined coarser grain value is the coarser grain value of one of the dimension coordinates, of the subset, having that finer grain value. For each of the dimension coordinates of the plurality of dimension coordinates not in the subset, the coarser grain value associated with that dimension coordinate is the coarser grain value of that dimension coordinate. A report is generated in view of the plurality of dimension coordinates and their associated coarser grain values.

Description

    CROSS REFERENCE TO RELATED APPLICATIONS
  • This application is a continuation of prior, co-pending U.S. patent application Ser. No. 11/615,694, filed on Dec. 22, 2006, which is incorporated herein by reference in its entirety for all purposes.
  • BACKGROUND
  • The present invention is in the field of processing a report query to a dimensionally-modeled fact collection (i.e., facts of or derived from a collection of facts organized as, or otherwise accessible according to, a dimensional data model). In particular, the present invention relates to reporting on facts considering the phenomena in which a grain value at a second grain (“finer grain values” at a “finer grain”) of one or more dimension coordinates satisfying a dimension coordinate constraint of the report query may also be the finer grain value for other dimension coordinates that also satisfy the dimension coordinate constraint but that have a different grain value at a first grain that is coarser than the finer grain (“coarser grain value” at a “coarser grain”). For readers not familiar with this terminology, the terminology of dimension coordinates, grains and grain values (including the properties of fineness and coarseness) are discussed below.
  • More particularly, it is known to respond to a report query to a dimensionally-modeled fact collection (facts organized in an n-dimensional data space) by performing operations with respect to dimension coordinates that satisfy a dimension coordinate constraint of the report query. Locations in an n-dimensional data space are specified by n-tuples of coordinates, where each member of the n-tuple corresponds to one of the n dimensions. For example, (“San Francisco”, “Sep. 30, 2002”) may specify a location in a two-dimensional data space, where the dimensions are LOCATION and TIME. Coordinates need not be single-grained entities. That is, coordinates of a single dimension may exist at, or be specified with respect to, various possible grains (levels of detail). In one example, a coordinate of a LOCATION dimension comprises the following grains: CONTINENT, COUNTRY and CITY.
  • The order of the grains may have some hierarchical significance. The grains are generally ordered such that finer grains are hierarchically “nested” inside coarser grains. Using the LOCATION dimension example, the CITY grain may be finer than the COUNTRY grain, and the COUNTRY grain may be finer than the CONTINENT grain. Where the order of the grains of a dimension has hierarchical significance, the value of a coordinate of that dimension, at a particular finer grain, is nominally such that the value of the coordinate of that dimension has only one value at any coarser grain for that dimension. In an example, a value of a coordinate of a LOCATION dimension may be specified at the CITY grain of the LOCATION dimension by the value “Los Angeles.” This same coordinate has only one value at the COUNTRY and CONTINENT grains: “United States” and “North America,” respectively.
  • SUMMARY
  • Processing a report query to a dimensional data model includes processing a plurality of dimension coordinates that exist within the dimensional data model. Each of the plurality of dimension coordinates has a second particular grain (“finer grain”) that is finer than a first particular grain (“coarser grain”). The report query specifies a dimension coordinate constraint to which the plurality of dimension coordinates correspond.
  • A subset of the plurality of dimension coordinates are dimension coordinates for which there is ambiguity as to what coarser grain value to associate with the finer grain value. That is, for the subset of the plurality of dimension coordinates, each of the dimension coordinates of the subset is such that there is at least one other dimension coordinate of the subset having a value at the finer grain (“finer grain value”) that is the same as the finer grain value of that dimension coordinate, and the at least one other dimension coordinate also has a value at the coarser grain (“coarser grain value”) that is different from the coarser grain value of that dimension coordinate.
  • For every unique finer grain value of the dimension coordinates of the subset, it is determined what coarser grain value to associate with all dimension coordinates of the subset having that finer grain value. The determined coarser grain value is the coarser grain value of one of the dimension coordinates, of the subset, having that finer grain value. For each of the dimension coordinates of the plurality of dimension coordinates not in the subset, the coarser grain value associated with that dimension coordinate is the coarser grain value of that dimension coordinate.
  • A report is generated in view of the plurality of dimension coordinates and their associated coarser grain values.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • FIG. 1 graphically illustrates a simple situation in which there is only one finer grain value for which there is an ambiguity and, further, the ambiguity is between only two possible coarser grain values.
  • FIG. 2 graphically illustrates an example in which, similar to the FIG. 1 example, disambiguation separately occurs with respect to disambiguation time chambers for each time reporting label of the time reporting range of the report query.
  • FIG. 3 graphically illustrates an example in which a disambiguation occurs for a disambiguation time chamber that spans more than one time reporting label.
  • FIG. 4 is a block diagram illustrating an example architecture of a system in which reporting of facts of a dimensionally-modeled fact collection may be performed, including disambiguating as desired or as otherwise determined to be appropriate.
  • FIG. 5 is a flowchart illustrating an example of multiple-pass processing including disambiguation.
  • DETAILED DESCRIPTION
  • The inventors have realized that it is desirable to consider the phenomenon in which, for a subset of a plurality of dimension coordinates that satisfy a report query, there are dimension coordinates of the subset that have the same grain value at a finer grain but a different grain value at a coarser grain. In this case, when performing operations with respect to dimension coordinates of this subset, there is ambiguity as to what coarser grain value to associate with the finer grain value.
  • This phenomenon may arise, for example, when one or more dimensions in which the dimension coordinates exist is a slowly changing dimension. This is a phenomenon in which the relationship of grains for a dimension may change over time. While it may be contrived to consider the concept of slowly changing dimensions with reference to the example LOCATION dimension (since, generally, the relationship of CONTINENT, COUNTRY and CITY grains will not change over time), there are other more realistic examples of this phenomenon.
  • As one illustration, consider an EMPLOYEE dimension that is intended to represent an organizational chart of a company. In this example, the EMPLOYEE dimension comprises the following grains: ORGANIZATION, DIVISION, TEAM and PERSON. Using this example, it can be seen that values of coordinates at various grains may change as a person moves from one team to another team (or, perhaps, a team moves from one division to another division). For example, at the beginning of one quarter, Bill worked on the Red Team; sometime during the quarter, Bill moved to the Blue Team. This may be modeled by one EMPLOYEE dimension coordinate having the value “Bill” at grain PERSON and the value “Red Team” at grain TEAM, plus a second EMPLOYEE dimension coordinate also having the value “Bill” at grain PERSON but the value “Blue Team” at grain TEAM. It is also possible to encode in the representation of the dimension coordinates the specific time intervals during which these grain relationships obtained.
  • As a simplistic example of an operation to be performed with respect to dimension coordinates satisfying a dimension coordinate constraint, it may be desired to compute the average number of cookies eaten by each team's members during Q4 2005. This computation considers multiple dimensional grains. That is, the statistical population is defined at the PERSON grain (cookies eaten by members), while the reported result is at the TEAM grain (i.e., the results are reported on a per team basis) for the time period corresponding to the Q4 2005 time reporting label (shorthand—“Q4 2005 time period”).
  • Consider the following dimension coordinates, and metric values, characterized by a time period corresponding to the Q4 2005 time period:
  • TABLE 1
    Metric Value Time Reporting
    Person Dimension Coordinate (# cookies) Label
    Mary: Red Team 100 Q4-2005
    Bill: Red Team 60 Q4-2005
    Bill: Blue Team 60 Q4-2005
    Saul: Blue Team 90 Q4-2005

    The cookie eating metric values could be left attached to both the PERSONs and TEAMs to which they accrued, and an average could be computed as:

  • Red Team=(100+60)/2=80

  • Blue Team=(60+90)/2=75   (Result 1-1)
  • This preserves an ambiguity about Bill's team membership during the Q4 2005 time period and artificially deflates the per PERSON average of both teams, since Bill is counted twice.
  • On the other hand, the ambiguity about Bill's team membership during the Q4 2005 time period can be arbitrarily disambiguated. For example, all of Bill's cookie eating metric values for the Q4 2005 time period could be attributed to the Red Team, even metric values for cookies eaten by Bill while Bill was on the Blue Team:

  • Red Team=(100+(60+60))/2=110

  • Blue Team=(90)/1=90   (Result 1-2)
  • Or, all of Bill's cookie eating metric values could be attributed to the Blue Team for the Q4 2005 time period, even metric values for cookies eaten by Bill while Bill was on the Red Team:

  • Red Team=(100)/1=100

  • Blue Team=((60+60)+90)/2=105   (Result 1-3)
  • In accordance with an aspect of the invention, then, and referring to the specific example of Bill and the Red Team and Blue Team, a determination is made whether those dimension coordinates corresponding to the Q4 2005 time reporting label and having a value of Bill at the PERSON grain are treated as having a value of Red Team or of Blue Team at the TEAM grain. Thus, for example, if it is determined that dimension coordinates having a value of Bill at the PERSON grain are to be treated as having a value of Red Team at the TEAM grain, then even the dimension coordinate having a value of Bill at the PERSON grain and having a value of Blue Team at the TEAM grain will be treated as having a value of Red Team at the TEAM grain.
  • More generally, there may be a subset of a plurality of dimension coordinates satisfying a dimension coordinate constraint of a report query, where each dimension coordinate of the subset is such that there is at least one other dimension coordinate of the subset having a finer grain value that is the same as the finer grain value of that dimension coordinate (e.g., Bill at the PERSON grain) and the at least one other dimension coordinate also has a coarser grain value that is different from the coarser grain value of that dimension coordinate (e.g., another dimension coordinate has a value of Red Team at the TEAM grain and that dimension coordinate has a value of Blue Team at the TEAM grain). In accordance with the aspect, for every unique finer grain value of the dimension coordinates of the subset (e.g., Bill is a unique grain value at the PERSON grain), the coarser grain value to associate with all dimension coordinates of the subset having that finer grain value is considered to be the coarser grain value of one of the dimension coordinates, of the subset, having that finer grain value (e.g., the coarser grain value to associate with the finer grain value of Bill is considered to be either Red Team or Blue Team).
  • FIG. 1 illustrates this aspect graphically. With respect to FIG. 1, the PERSON grain is the finer grain and the TEAM grain is the coarser grain. The dimension coordinate 102 and the dimension coordinate 104 are considered to be dimension coordinates of a “subset.” (As mentioned above, each dimension coordinate of a subset is such that there is at least one other dimension coordinate of the subset having a finer grain value that is the same as the finer grain value of that dimension coordinate and the at least one other dimension coordinate also has a coarser grain value that is different from the coarser grain value of that dimension coordinate.) More particularly, the dimension coordinate 102 and the dimension coordinate 104 each have the value Bill at the PERSON grain (finer grain), but the dimension coordinate 102 and the dimension coordinate 104 have different values at the TEAM grain. That is, the dimension coordinate 102 has the value Blue Team at the TEAM grain, and the dimension coordinate 104 has the value Red Team at the TEAM grain.
  • Some mechanism has been used to determine and process the time period by which the dimension coordinates 102 and 104 are characterized and, thus, to associate each of the dimension coordinates 102 and 104 (and, perhaps, one or more dimension coordinates that are not shown, for which there is no ambiguity as to what coarser grain value to associate with the finer grain values) with a particular time reporting label. In the FIG. 1 examples, the (one and only) particular time reporting label is Q4 2005.
  • There are various mechanisms by which dimension coordinates may be associated with time reporting labels One example is described in pending U.S. patent application Ser. No. 11/427,718, entitled “Temporal Extent Considerations in Reporting on Facts Organized as a Dimensionally-Modeled Fact Collection,” filed on Jun. 29, 2006 and incorporated by reference herein in its entirety for all purposes. For example, in the U.S. patent application Ser. No. 11/427,718, the following description is provided:
      • In one example, the multidimensional fact collection includes metadata that provides information from which the temporal characteristics of the grain relationships can be discerned. (See, for example, the article entitled “Kimball Design Tip #8: Perfectly Partitioning History With The Type 2 Slowly Changing Dimension,” available at http://www.kimballgroup.com/html/designtipsPDF/DesignTips2000%20/Kimball DT8Perfectly.pdf, which describes augmenting dimension records with “time stamps” to temporally characterize the dimension records.)
        For purposes of the present discussion, however, it should just be considered that a particular association of dimension coordinates to time reporting label(s) has been or can be somehow determined.
  • Referring still to FIG. 1, block 106 represents the coarser grain value of Blue Team, whereas block 108 represents the coarser grain value of Red Team. It can be seen that Blue Team and Red Team are each a possible coarser grain value to associate with the finer grain value of Bill, which is the finer grain value at the PERSON grain of both the dimension coordinate 102 and the dimension coordinate 104. In the FIG. 1 diagram, the “switch” 110 graphically represents a result of a disambiguation determination 112 as to which of the Blue Team value and the Red Team value is to be associated with the finer grain value of Bill, at the PERSON grain.
  • For example, if the result of the disambiguation determination 112 is that the Blue Team value is to be associated with the value Bill at the PERSON grain, then the switch 110 is figuratively positioned such that the Blue Team value 106 is associated with the value of Bill at the PERSON grain for the dimension coordinate 102 and the dimension coordinate 104, even though the dimension coordinate 104 has an actual value of Red Team at the TEAM grain. Referring to the examples above—computing the average number of cookies eaten by each team's members for the Q4 2005 time period—this would result in processing the dimension coordinates as set forth with respect to Result 1-3 above.
  • On the other hand, if the result of the disambiguation determination 112 is that the Red Team value is to be associated with the value Bill at the PERSON grain, then the switch 110 is figuratively positioned such that the Red Team value is associated with the value of Bill at the PERSON grain for both the dimension coordinate 102 and the dimension coordinate 104, even though the dimension coordinate 102 has an actual value of Blue Team at the TEAM grain. Again referring to the examples above—computing the average number of cookies eaten by each team's members during Q4 2000—this would result in processing the dimension coordinates as set forth with respect to Result 1-2 above.
  • It is noted that FIG. 1 represents a simple situation in which, for a particular subset of dimension coordinates, there is only one finer grain value for which there is an ambiguity as to an associated coarser grain value and, further, the ambiguity is between only two possible coarser grain values. By extension, there may be situations in which there is more than one finer grain value for which there is an ambiguity. In general, for example, these situations may be handled by separately disambiguating for each finer grain value for which there is an ambiguity. Furthermore, an ambiguity may be between more than two possible coarser grain values. Where an ambiguity is between more than two possible coarser grain values, the disambiguation results in a single one of the possible coarser grain values being associated with a particular finer grain value.
  • As mentioned several time above, we may collectively denote the dimension coordinates having finer grain values for which there is an ambiguity as a “subset” of dimension coordinates. Furthermore, FIG. 1, along with Table 1 and Results 1-2 and 1-3, illustrates an example where not only the dimension coordinates of the subset, but also the dimension coordinates of the larger group to which the subset belongs, have all been determined to be associated with a single particular time period. In particular, the example is one in which each of the dimension coordinates considered for disambiguation corresponds to the time period of the single Q4 2005 time reporting label.
  • Turning now to FIG. 2, unlike the FIG. 1 example, FIG. 2 exhibits an example for which there is more than a single time reporting label. That is, with respect to FIG. 2, a report is on the average number of cookies eaten by each team's members during Q4 2005, the time reporting range, reported on a monthly basis. The number of time reporting labels is three—OCT-2005, NOV-2005 and DEC-2005.
  • According to this example, each time reporting label corresponds to a separate non-overlapping time period, namely, the time periods associated with the OCT-2005, NOV-2005 and DEC-2005 time periods. In addition, each dimension coordinate satisfying the dimension coordinate constraint of a report query is associated with one of the separate non-overlapping time periods which we call “disambiguation time chambers.” Each disambiguation time chamber corresponds to a different non-overlapping time period of the time reporting range, and the subsets for which there is disambiguation exist on a disambiguation time chamber by disambiguation time chamber basis, based on a correspondence between a time period with which a dimension coordinate is associated and a time period associated with a disambiguation time chamber.
  • FIG. 2 illustrates a simple example, in which the disambiguation time chambers are at the same resolution as the time reporting labels and, thus, the disambiguation time chambers coincide with the time reporting labels. Since the disambiguation time chambers coincide with time reporting labels in the FIG. 2 example, not only do the subsets exist on a disambiguation time chamber by disambiguation time chamber basis, it also follows that the subsets exist on a time reporting label by time reporting label basis. In the FIG. 2 example, there may be a subset for which there is disambiguation for each of the OCT-2005, NOV-2005 and DEC-2005 time periods. By contrast, we explain later with respect to the FIG. 3 example how the disambiguation time chambers may be at a coarser resolution than the time reporting labels and, thus, each disambiguation time chamber may simultaneously correspond to two or more time reporting labels. (We also note that a particular dimension coordinate may be associated with more than one of the time periods with which time reporting labels are associated. We will note an example of this with reference to Table 2, later in this description.)
  • Perhaps an easier way to consider this concept is that a time period to which each separate set of dimension coordinates corresponds is defined by a time period to which one or more of the time reporting labels corresponds. For shorthand, we refer to the time period to which one of the separate sets of dimension coordinates corresponds as a “disambiguation time chamber.” In the FIG. 1 example, there is one disambiguation time chamber, and it corresponds to the Q4 2005 time period. In the FIG. 2 example, there are three disambiguation time chambers, and the three disambiguation time chambers correspond to the OCT-2005, NOV-2005, and DEC-2005, time periods, respectively. Later, we will see that not only may a disambiguation time chamber be defined by the time period to which one of the time reporting labels corresponds but, also, a disambiguation time chamber may be defined by the time period to which more than one of the time reporting labels collectively correspond (or, put another way, a disambiguation time chamber may correspond to one or more time reporting labels).
  • Before leaving FIG. 2, we again mention that, as discussed above relative to FIG. 1, for each subset of dimension coordinates considered for disambiguation, there may be one or more finer grain values for which there is an ambiguity as to what is the associated coarser grain value. For example, maybe there is only an ambiguity as to the coarser grain value associated with “Bill” or maybe there is an ambiguity as to the coarser grain value associated with “Bill” and there is also an ambiguity as to the coarser grain value associated with “Steve.” Furthermore, for a particular one of those finer grain values, the disambiguation may be among two or more coarser grain values (e.g., the disambiguation may be among Red Team and Blue team, or the disambiguation may be Red Team, Blue Team and Green Team).
  • We now discuss an example in which a situation like the FIG. 2 situation may apply. That is, we discuss an example in which there are multiple disambiguation time chambers, the disambiguation time chambers being at the same resolution as the time reporting labels such that each disambiguation time chamber corresponds to one separate time reporting label. Consider the following dimension coordinates, metric values and time reporting labels:
  • TABLE 2
    Metric Value Time Reporting
    Person Dimension Coordinate (# cookies) Label
    Mary: Red Team 25 OCTOBER 2005
    Mary: Red Team 35 NOVEMBER 2005
    Mary: Red Team 40 DECEMBER 2005
    Bill: Red Team 40 OCTOBER 2005
    Bill: Red Team 20 NOVEMBER 2005
    Bill: Blue Team 20 NOVEMBER 2005
    Bill: Blue Team 40 DECEMBER 2005
    Saul: Blue Team 30 OCTOBER 2005
    Saul: Blue Team 30 NOVEMBER 2005
    Saul: Blue Team 30 DECEMBER 2005

    (Above, it was mentioned that an example would be discussed, with reference to Table 2, of a particular dimension coordinate being associated with more than one of the time periods to which time reporting labels correspond. In Table 2, an example of such a dimension coordinate includes the dimension coordinate having the value Mary at the PERSON grain and having the value Red Team at the TEAM grain. This dimension coordinate is associated with all of the following time reporting labels: OCT 2005, NOV 2005 and DEC 2005.)
  • With respect to the Table 2 dimension coordinates, metric values and time reporting labels, the cookie eating metric values could be left “attached” to both the PERSONs and TEAMs to which it accrued (i.e., no disambiguation), and an average per team, per each time reporting label, could be computed as:
  • (Result 2-1)
    Month Red Team Blue Team
    OCTOBER 2005 (25 + 40)/2 = 32.5 (30)/1 = 30
    NOVEMBER 2005 (35 + 20)/2 = 27.5 (20 + 30)/2 = 25
    DECEMBER 2005 (40)/1 = 40 (40 + 30)/2 = 35

    As with the per-Result 1-1 above, it is noted how PERSON average is artificially depressed for both TEAMs for the time reporting label NOV-2005, corresponding to the month Bill changed teams.
  • Alternatively, for a disambiguation time chamber defined by the time period to which the NOV-2005 time reporting label corresponds, the TEAM value of Red Team could be attributed to Bill for the NOV-2005 time reporting label. (It is noted that, with respect to the dimension coordinates in Table 2, dimension coordinates associated with the NOV-2005 disambiguation time chamber are the only dimension coordinates for which an ambiguity exists as to coarser grain values associated with particular finer grain values.)
  • (Result 2-2)
    Month Red Team Blue Team
    OCTOBER 2005 (25 + 40)/2 = 32.5 (30)/1 = 30
    NOVEMBER 2005 (35 + (20 + 20))/2 = 37.5 (30)/1 = 30
    DECEMBER 2005 (40)/1 = 40 (40 + 30)/2 = 35

    While the Table 2 dimension coordinates are such that disambiguation is not appropriate for dimension coordinates other than a subset of dimension coordinates characterized by a time to which the NOV-2005 time reporting label corresponds (i.e., associated with the disambiguation time chamber defined by the NOV-2005 time period), for other dimension coordinates, it may be appropriate for there to be disambiguation for dimension coordinates of a subset of dimension coordinates associated with the disambiguation time chamber defined by the OCT-2005 time period and/or for the dimension coordinates of a subset of dimension coordinates associated with the disambiguation time chamber defined by the DEC-2005 time period (invoking determination 202 and/or determination 206).
  • As another alternative with respect to the Table 2 data, the disambiguation for the dimension coordinates associated with the disambiguation time chamber defined by the NOV-2005 time period could result in the TEAM value of Blue Team being attributed to Bill for the NOV-2005 time reporting label.
  • (Result 2-3)
    Month Red Team Blue Team
    OCTOBER 2005 (25 + 40)/2 = 32.5 (30)/1 = 30
    NOVEMBER 2005 (35)/1 = 35 (30 + (20 + 20))/2 = 35
    DECEMBER 2005 (40)/1 = 40 (40 + 30)/2 = 35

    It can thus be seen that, in general, a disambiguation may occur separately for any or all disambiguation time chambers (which correspond to time reporting labels by being defined by time periods to which the time reporting labels correspond) for which there is reporting based on the report query.
  • Furthermore, unlike the FIG. 1 and FIG. 2 example, in which the disambiguation time chambers each correspond to a separate single respective time reporting label, there may be examples in which the disambiguation time chambers each correspond to more than one time reporting label. For example, the reporting labels may be at a month resolution (e.g., JAN-2005, FEB-2005, . . . , NOV-2005 and DEC-2005) of the time dimension. The disambiguation time chambers, may on the other hand, be at a quarter resolution (e.g., Q11 2005, Q2 2005, Q3 2005 and Q4 2005). In other words, all the dimension coordinates characterized by a time period that corresponds to any time reporting label for a month in a particular quarter would be associated with that particular quarter for disambiguation purposes.
  • FIG. 3 illustrates such an example. Referring to FIG. 3, dimension coordinates are shown that are associated with time periods corresponding to the time reporting labels JAN-2005 (302), FEB-2005 (304), MAR-2005 (306), APR-2005 (312), MAY-2005 (314) and JUN-2005 (316). Dimension coordinates associated with time periods corresponding to the remaining time reporting labels for 2005 are not shown, to simplify the illustration.
  • Using the example of the month resolution and quarter resolution of the 2005 time period, the disambiguation time chambers would be defined by each quarter-year time period of the whole year, for a total of four disambiguation time chambers. Again, there are four disambiguation time chambers for the 2005 time period, even though there are twelve time reporting labels for the 2005 time period. That is, the disambiguation decision for each PERSON grain entity could thus be made up to four times, once for each quarter (e.g., determinations 308 and 318 for the first and second quarter, respectively), even though the report processing is carried out twelve times, once for each “month” time reporting label.
  • In the discussion thus far, we have not described what criterion may be used to make particular disambiguation determinations (such as, for example, the determination 112 in FIG. 1; the determinations 202, 204 and 206 in FIG. 2; and the determinations 308 and 218 in FIG. 3).
  • In one example, relative to a disambiguation time chamber that corresponds to November 2005, it is supposed that Alice worked for Red Team from before November 2005 and up until 6 Nov. 2005. Alice worked for Green Team from 7 Nov. 2005 to 21 Nov. 2005. Finally, Alice worked for Blue Team from 22 Nov. 2005 until well after November 2005. Furthermore, Alice eats one cookie every day she works, and she works every day in November.
  • Further, suppose each team was allotted a bonus budget. Red Team's bonus budget is $3K, Green Team's bonus budget is $5k and Blue Team's bonus budget is $7k. Disambiguating Alice's team for the November 2005 disambiguation time chamber, various criteria may be considered, with varying results as to the coarser grain value (value at TEAM grain) to associate with the finer grain value of Alice (value at PERSON grain).
  • For example, if criterion equals “latest team,” then the result is Blue Team. If the criterion equals “earliest team,” then the result equals Red Team. It is noted that the “latest team” and “earliest team” criteria are time based. Other criteria may include, for example, “longest team membership during time of disambiguation time chamber.” For this criterion, the results equals “Green Team.” For the criterion of “highest bonus budget,” the result equals Blue Team. For the criterion of “team on which she ate the most cookies,” the result equals “Green Team.”
  • It can be seen, then, that many different criteria may be used.
  • We now discuss some particulars of the processing that may be done in view of the association of coarser grain values with finer grainer values, where such an association may be a result of a disambiguation. In particular, processing for generating the report may include associating metric values in a “copying down” or “rolling up” direction. “Rolling up” includes associating, with a coarser grain value, a metric value that corresponds to a finer grain value (i.e., rolling up from finer to coarser) with which that coarser grain value is associated. Determining, for each month, the average number of cookies per person (metric value that corresponds to a finer grain value—at PERSON grain) eaten by each of Red Team and Blue Team (coarser grain value—at TEAM grain) is an example of rolling up. In other words, “rolling up” includes associating in a many (finer grain values) to one (coarser grain value) manner.
  • “Copying down” includes associating, with finer grain values, a metric value that corresponds to a coarser grain value (i.e., copying down from coarser to finer). An example of copying down includes, for each month, associating the team goal (coarser grain value—at TEAM grain) for every person (finer grain value—at PERSON grain). In other words, “copying down” includes associating in a one (coarser grain value) to many (finer grain value) manner.
  • In either case (copying down or rolling up), the disambiguation is useful for resolving what coarser grain value is to be associated with finer grain values, for which the association may otherwise be ambiguous.
  • FIG. 4 is a block diagram illustrating an example architecture of a system 400 in which reporting of facts of a dimensionally-modeled fact collection may be performed, including disambiguating as desired or as otherwise determined to be appropriate. Referring to FIG. 4, a user 402 may cause a report query 404 to be provided to a fact collection query generator 406. For example, the user 402 may interact with a web page via a web browser, where the web page is served by a report user interface using, for example, a Java Server Page mechanism. In this example, the user 402 interacts with the web page such that the report query 404 is provided to the fact collection query generator 406. The report query 104 includes a dimension coordinate constraint, which may be one or more dimension coordinate constraints.
  • In general, a dimension coordinate constraint for a dimension of the dimensionally-modeled fact collection specifies coordinates of that dimension of the dimensionally-modeled fact collection. For example, a dimension coordinate constraint may specify coordinates of that dimension of the dimensionally-modeled fact collection by specifying a value of the dimension at a particular grain. Dimension coordinate constraints of the report query 404, then, specify a plurality of coordinates of one or more dimensions of the dimensionally-modeled fact collection, on which a report is to be based. It is noted that the lack of an explicit constraint may imply a “null constraint” (which, in and of itself, may be considered a dimension coordinate constraint) for which the resulting plurality of dimension coordinates on which the report is to be based, is all dimension coordinates from that dimension.
  • The fact collection query generator 406 processes the report query 404 to generate an appropriate corresponding fact collection query 408, which is presented to the dimensionally-modeled fact collection 410. A result 416 of presenting the fact collection query 408 to the dimensionally-modeled fact collection 410 is processed by a report generator 418 to generate a report corresponding to the report query 404 caused to be provided by the user 402. In particular, the generated report includes an indication of processing with respect to dimensional members as appropriate in view of the dimension coordinate constraints of the report query 404.
  • In one example, the dimensionally-modeled fact collection 410 is implemented as a relational database, storing fact data in a manner that is accessible to users according to a ROLAP—Relational Online Analytical Processing—schema (fact and dimension tables). In this case, the fact collection query 408 may originate as a database query, in some form that is processed into another form, for example, which is processed by an OLAP query engine into a fact collection query 408, presented as an SQL query that is understandable by the underlying relational database. This is just one example, however, and there are many other ways of representing and accessing a dimensionally-modeled fact collection.
  • Processing 418 is applied to the fact collection result 416 to generate a report. The generated report includes an indication of dimension members and facts corresponding to those indicated dimension members. What facts are reported may depend, at least in part, on disambiguation of what coarser grain value is determined to correspond to particular finer grain values.
  • Referring still to FIG. 4, the composition of the generated report may be accomplished by the fact collection query generator 406 particularly generating the fact collection query 408 in accordance with the report query, by the result processing 418 particularly processing the fact collection result (e.g., by applying filtering) in accordance with the report query, or by a combination of both.
  • As also illustrated in FIG. 4, the report query 404 may include a disambiguation criterion that may be provided, for example, via a user interface. In some examples, in the absence of such a disambiguation criterion, the manner in which the facts corresponding to those indicated dimension members are reported may be according to a default mode or according to a preconfigured mode. The fact collection query generator 406 and/or the result processing 418, as appropriate, operate according to the default, preconfigured or designated mode.
  • In accordance with one example, a multiple-pass processing is utilized. An example of the multiple-pass processing is illustrated by the flowchart of FIG. 5. In a first pass 502, every dimension coordinate (which may be one or more dimension coordinates) that satisfies the dimension coordinate constraint is determined. The determined dimension coordinates have a particular value at a particular grain. In another pass 504, it is determined which of the dimension coordinates are part of a subset of “ambiguous” dimension coordinates, meeting the following conditions:
      • Each of the dimension coordinates of the subset is such that there is at least one other dimension coordinate of the subset having a value at the finer grain (“finer grain value”) that is the same as the finer grain value of that dimension coordinate; and
      • the at least one other dimension coordinate also has a value at the coarser grain (“coarser grain value”) that is different from the coarser grain value of that dimension coordinate.
        That is, it is determined which of the dimension coordinates satisfying the dimension coordinate constraint of the report query are subject to the coarser grain value to finer grain value association ambiguity.
  • At step 506, for every unique finer grain value of the dimension coordinates of the subset, it is determined what coarser grain value to associate with all dimension coordinates of the subset having that finer grain value. In particular, the determined coarser grain value is the coarser grain value of one of the dimension coordinates, of the subset, having that finer grain value. For each of the dimension coordinates of the plurality of dimension coordinates not in the subset, the coarser grain value associated with that dimension coordinate is the coarser grain value of that dimension coordinate. Steps 504 and 506 may be repeated for additional disambiguation time chambers. That is, the subset would be determined for each additional disambiguation time chamber and disambiguation carried out as appropriate on that subset. Finally, at step 508, a report is generated in view of the plurality of dimension coordinates and their associated coarser grain values.
  • We have thus described how a situation may be addressed in which a subset of a plurality of dimension coordinates satisfying a dimension coordinate constraint of a report query are such that, for each dimension coordinate of the subset, there is at least one other dimension coordinate of the subset having a finer grain value that is the same as the finer grain value of that dimension coordinate and the at least one other dimension coordinate also has a coarser grain value that is different from the coarser grain value of that dimension coordinate. In particular, disambiguation may be carried out such that, for every unique finer grain value of the dimension coordinates of the subset, the coarser grain value to associate with all dimension coordinates of the subset having that finer grain value is determined, wherein the determined coarser grain value is the coarser grain value of one of the dimension coordinates, of the subset, having that finer grain value. A report may be generated in view of the plurality of dimension coordinates and their associated coarser grain values.

Claims (30)

1. A computer-implemented method of processing a report query to a dimensional data model by processing a plurality of dimension coordinates that exist within the dimensional data model, wherein each of the plurality of dimension coordinates has a second particular grain (“finer grain”) that is finer than a first particular grain (“coarser grain”) and having a value at the finer grain (“finer grain value”) and at the coarser grain (“coarser grain value”), the report query specifying a dimension coordinate constraint to which the plurality of dimension coordinates correspond, the computer-implemented method being carried out by at least one computing device executing instructions from a computer-readable medium, the method comprising:
for a temporal dimension having temporal characteristics of grain relationships including a subset of the plurality of dimension coordinates in which there is a time changing relationship of the grains over a time period of interest leading to-a potential ambiguity as to what coarser grain value to associate with a finer grain value in attributing facts at the finer grain value to the coarser grain value,
the fine grain value corresponding to a value at a person grain and the coarse grain value corresponds to a value at a team grain, wherein the disambiguation resolves an ambiguity regarding which team to associate with at least one person in an employee dimension of an organizational chart;
wherein
each of the dimension coordinates of the subset is such that there is at least one other dimension coordinate of the subset having a value at the finer grain value that is the same as the finer grain value of that dimension coordinate; and
the at least one other dimension coordinate also has a value at the coarser grain value that is different from the coarser grain value of that dimension coordinate,
for every unique finer grain value of the dimension coordinates of the subset, by the at least one computing device, executing computer program instructions from the computer-readable medium to determine the coarser grain value to associate with all dimension coordinates of the subset having that finer grain value, wherein the determined coarser grain value is the coarser grain value of one of the dimension coordinates, of the subset, having that finer grain value such that the relationship is disambiguated by applying a disambiguation criterion to determine the association between coarser grain values when the finer grain value is associated with at least two different coarser grain values during the time period of interest;
wherein, for each of the dimension coordinates of the plurality of dimension coordinates not in the subset, the coarser grain value associated with that dimension coordinate is the coarser grain value of that dimension coordinate;
the method further comprising by the at least one computing device, executing computer program instructions from the computer-readable medium to generate a report in view of the plurality of dimension coordinates and their associated coarser grain values.
2. The computer-implemented method of claim 1, wherein:
the report query directly specifies the disambiguation criterion.
3. The computer-implemented method of claim 1, wherein:
the report query indirectly specifies the disambiguation criterion.
4. The computer-implemented method of claim 1, further comprising reporting at least one team metric for contributions of individual team members based on a disambiguated relationship between fine grain values at the person grain and coarse grain values at the team grain.
5. A computer-implemented method of processing a report query to a dimensional data model by processing a plurality of dimension coordinates that exist within the dimensional data model, wherein each of the dimension coordinates has a second particular grain (“finer grain”) that is finer than a first particular grain (“coarser grain”) and having a value at the finer grain (“finer grain value”) and at the coarser grain (“coarser grain value”), the report query specifying constraints including dimension coordinate constraints and a time reporting range constraint, the plurality of dimension coordinates corresponding to the constraints specified by the report query, the computer-implemented method being carried out by at least one computing device_-executing instructions from a computer-readable medium, the method comprising:
for each of a plurality of disambiguation time chambers, each disambiguation time chamber corresponding to a different non-overlapping time period of the time reporting range,
for a temporal dimension having temporal characteristics of grain relationships including a subset of the plurality of dimension coordinates in which there is a time changing relationship of the grains over a time period of interest leading to a potential ambiguity as to what coarser grain value to associate with a finer grain value in attributing facts at the finer grain value to the coarser grain value,
wherein
each dimension coordinate of the subset is associated with the time period to which that disambiguation time chamber corresponds;
each of the dimension coordinates of the subset is such that there is at least one other dimension coordinate of the subset having a value at the finer grain value that is the same as the finer grain value of that dimension coordinate; and
the at least one other dimension coordinate also has a value at the coarser grain value that is different from the coarser grain value of that dimension coordinate,
for every unique finer grain value of the dimension coordinates of the subset, by the at least one computing device, executing computer program instructions from the computer-readable medium to determine the coarser grain value to associate with all dimension coordinates of the subset having that finer grain value, wherein the determined coarser grain value is the coarser grain value of one of the dimension coordinates, of the subset, having that finer grain value so that the relationship is disambiguated by applying a disambiguation criterion to determine the association between coarser grain values when the finer grain value is associated with at least two different coarser grain values during the time period of interest;
wherein, for each of the dimension coordinates of the plurality of dimension coordinates associated with the time period to which that disambiguation time chamber corresponds but is not in the subset, the coarser grain value associated with that dimension coordinate is the coarser grain value of that dimension coordinate;
the method further comprising by the at least one computing device, executing computer program instructions from the computer-readable medium to generate a report in view of the plurality of dimension coordinates and their associated coarser grain values;
wherein the fine grain value corresponding to a value at a person grain and the coarse grain value corresponds to a value at a team grain, wherein the disambiguation resolves an ambiguity regarding which team to associate with at least one person in an employee dimension of an organizational chart.
6. The computer-implemented method of claim 5, wherein:
the report query directly specifies the disambiguation criterion.
7. The computer-implemented method of claim 5, wherein:
the report query indirectly specifies the disambiguation criterion.
8. The computer-implemented method of claim 5, further comprising reporting at least one team metric for contributions of individual team members based on a disambiguated relationship between fine grain values at the person grain and coarse grain values at the team grain.
9. A computer program product having computer program instructions stored on a computer readable medium which are operable to cause at least one computing device to:
issue a report query for a dimensional data model having a plurality of dimension coordinates that exist within the dimensional data model, wherein each of the plurality of dimension coordinates has a second particular grain (“finer grain”) that is finer than a first particular grain (“coarser grain”) and having a value at the finer grain (“finer grain value”) and at the coarser grain (“coarser grain value”), the report query specifying a dimension coordinate constraint to which the plurality of dimension coordinates correspond;
receiving, in response to the report query, a report in view of the plurality of dimension coordinates and their associated coarser grain values, the report processed to:
for a temporal dimension having temporal characteristics of grain relationships including a subset of the plurality of dimension coordinates in which there is a time changing relationship of the grains over a time period of interest leading to-a potential ambiguity as to what coarser grain value to associate with a finer grain value in attributing facts at the finer grain value to the coarser grain value,
wherein
each of the dimension coordinates of the subset is such that there is at least one other dimension coordinate of the subset having a value at the finer grain value that is the same as the finer grain value of that dimension coordinate; and
the at least one other dimension coordinate also has a value at the coarser grain value that is different from the coarser grain value of that dimension coordinate,
the report processed to determine the coarser grain value to associate with all dimension coordinates of the subset having that finer grain value, wherein the determined coarser grain value is the coarser grain value of one of the dimension coordinates, of the subset, having that finer grain value such that the relationship is disambiguated by applying a disambiguation criterion to determine the association between coarser grain values when the finer grain value is associated with at least two different coarser grain values during the time period of interest;
wherein, for each of the dimension coordinates of the plurality of dimension coordinates not in the subset, the coarser grain value associated with that dimension coordinate is the coarser grain value of that dimension coordinate.
10. The computer-program product of claim 9, wherein:
the report query directly specifies the disambiguation criterion.
11. The computer-program product of claim 9, wherein:
the report query indirectly specifies the disambiguation criterion.
12. The computer-program product of claim 9, wherein the fine grain value corresponding to a value at a person grain and the coarse grain value corresponds to a value at a team grain, wherein the disambiguation resolves an ambiguity regarding which team to associate with at least one person in an employee dimension of an organizational chart.
13. The computer-program product of claim 12, further comprising reporting at least one team metric for contributions of individual team members based on a disambiguated relationship between fine grain values at the person grain and coarse grain values at the team grain.
14. A system, comprising:
at least one computing device having computer program instructions stored on a computer readable medium which are operable to cause the at least one computing device to:
process a report query for a dimensional data model having a plurality of dimension coordinates that exist within the dimensional data model, wherein each of the plurality of dimension coordinates has a second particular grain (“finer grain”) that is finer than a first particular grain (“coarser grain”) and having a value at the finer grain (“finer grain value”) and at the coarser grain (“coarser grain value”), the report query specifying a dimension coordinate constraint to which the plurality of dimension coordinates correspond;
generate, in response to the report query, a report in view of the plurality of dimension coordinates and their associated coarser grain values, the report being processed to:
for a temporal dimension having temporal characteristics of grain relationships including a subset of the plurality of dimension coordinates in which there is a time changing relationship of the grains over a time period of interest leading to-a potential ambiguity as to what coarser grain value to associate with a finer grain value in attributing facts at the finer grain value to the coarser grain value,
wherein
each of the dimension coordinates of the subset is such that there is at least one other dimension coordinate of the subset having a value at the finer grain value that is the same as the finer grain value of that dimension coordinate; and
the at least one other dimension coordinate also has a value at the coarser grain value that is different from the coarser grain value of that dimension coordinate,
the report processed to determine the coarser grain value to associate with all dimension coordinates of the subset having that finer grain value, wherein the determined coarser grain value is the coarser grain value of one of the dimension coordinates, of the subset, having that finer grain value such that the relationship is disambiguated by applying a disambiguation criterion to determine the association between coarser grain values when the finer grain value is associated with at least two different coarser grain values during the time period of interest;
wherein, for each of the dimension coordinates of the plurality of dimension coordinates not in the subset, the coarser grain value associated with that dimension coordinate is the coarser grain value of that dimension coordinate.
15. The system of claim 14, wherein:
the report query directly specifies the disambiguation criterion.
16. The system of claim 14, wherein:
the report query indirectly specifies the disambiguation criterion.
17. The system of claim 14 further comprising at least one relational database to store fact data.
18. The system of claim 14 further comprising a fact collection query generator to process the report query and generate a fact collection query presented to a dimensionally modeled fact collection.
19. The system of claim 18, wherein the dimensionally modeled fact collection is implemented by at least one relational database.
20. The system of claim 19, further comprising a report generator.
21. The system of claim 14 wherein the fine grain value corresponding to a value at a person grain and the coarse grain value corresponds to a value at a team grain, wherein the disambiguation resolves an ambiguity regarding which team to associate with at least one person in an employee dimension of an organizational chart.
22. The system of claim 21, further comprising reporting at least one team metric for contributions of individual team members based on a disambiguated relationship between fine grain values at the person grain and coarse grain values at the team grain.
23. A computer implemented method, comprising:
issuing a report query for a dimensional data model having a plurality of dimension coordinates that exist within the dimensional data model, wherein each of the plurality of dimension coordinates has a second particular grain (“finer grain”) that is finer than a first particular grain (“coarser grain”) and having a value at the finer grain (“finer grain value”) and at the coarser grain (“coarser grain value”), the report query specifying a dimension coordinate constraint to which the plurality of dimension coordinates correspond;
receiving, in response to the report query, a report in view of the plurality of dimension coordinates and their associated coarser grain values, the report processed to:
for a temporal dimension having temporal characteristics of grain relationships including a subset of the plurality of dimension coordinates in which there is a time changing relationship of the grains over a time period of interest leading to-a potential ambiguity as to what coarser grain value to associate with a finer grain value in attributing facts at the finer grain value to the coarser grain value,
wherein
each of the dimension coordinates of the subset is such that there is at least one other dimension coordinate of the subset having a value at the finer grain value that is the same as the finer grain value of that dimension coordinate; and
the at least one other dimension coordinate also has a value at the coarser grain value that is different from the coarser grain value of that dimension coordinate,
the report processed to determine the coarser grain value to associate with all dimension coordinates of the subset having that finer grain value, wherein the determined coarser grain value is the coarser grain value of one of the dimension coordinates, of the subset, having that finer grain value such that the relationship is disambiguated by applying a disambiguation criterion to determine the association between coarser grain values when the finer grain value is associated with at least two different coarser grain values during the time period of interest;
wherein, for each of the dimension coordinates of the plurality of dimension coordinates not in the subset, the coarser grain value associated with that dimension coordinate is the coarser grain value of that dimension coordinate.
24. The computer-implemented method of claim 23, wherein:
the report query directly specifies the disambiguation criterion.
25. The computer-implemented method of claim 23, wherein:
the report query indirectly specifies the disambiguation criterion.
26. The computer-implemented method of claim 23, wherein the fine grain value corresponding to a value at a person grain and the coarse grain value corresponds to a value at a team grain, wherein the disambiguation resolves an ambiguity regarding which team to associate with at least one person in an employee dimension of an organizational chart.
27. The computer-implemented method of claim 26, further comprising reporting at least one team metric for contributions of individual team members based on a disambiguated relationship between fine grain values at the person grain and coarse grain values at the team grain.
28. A system, comprising:
at least one computing device having computer program instructions stored on a computer readable medium which are operable to cause the at least one computing device to:
issue a report query for a dimensional data model having a plurality of dimension coordinates that exist within the dimensional data model, wherein each of the plurality of dimension coordinates has a second particular grain (“finer grain”) that is finer than a first particular grain (“coarser grain”) and having a value at the finer grain (“finer grain value”) and at the coarser grain (“coarser grain value”), the report query specifying a dimension coordinate constraint to which the plurality of dimension coordinates correspond;
receive, in response to the report query, a report in view of the plurality of dimension coordinates and their associated coarser grain values, the report processed to:
for a temporal dimension having temporal characteristics of grain relationships including a subset of the plurality of dimension coordinates in which there is a time changing relationship of the grains over a time period of interest leading to-a potential ambiguity as to what coarser grain value to associate with a finer grain value in attributing facts at the finer grain value to the coarser grain value,
wherein
each of the dimension coordinates of the subset is such that there is at least one other dimension coordinate of the subset having a value at the finer grain value that is the same as the finer grain value of that dimension coordinate; and
the at least one other dimension coordinate also has a value at the coarser grain value that is different from the coarser grain value of that dimension coordinate,
the report processed to determine the coarser grain value to associate with all dimension coordinates of the subset having that finer grain value, wherein the determined coarser grain value is the coarser grain value of one of the dimension coordinates, of the subset, having that finer grain value such that the relationship is disambiguated by applying a disambiguation criterion to determine the association between coarser grain values when the finer grain value is associated with at least two different coarser grain values during the time period of interest;
wherein, for each of the dimension coordinates of the plurality of dimension coordinates not in the subset, the coarser grain value associated with that dimension coordinate is the coarser grain value of that dimension coordinate.
29. The system of claim 28, wherein the fine grain value corresponding to a value at a person grain and the coarse grain value corresponds to a value at a team grain, wherein the disambiguation resolves an ambiguity regarding which team to associate with at least one person in an employee dimension of an organizational chart.
30. The system of claim 29, further comprising reporting at least one team metric for contributions of individual team members based on a disambiguated relationship between fine grain values at the person grain and coarse grain values at the team grain.
US13/217,206 2006-12-22 2011-08-24 Disambiguation with respect to multi-grained dimension coordinates Abandoned US20110307512A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US13/217,206 US20110307512A1 (en) 2006-12-22 2011-08-24 Disambiguation with respect to multi-grained dimension coordinates

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US11/615,694 US8036859B2 (en) 2006-12-22 2006-12-22 Disambiguation with respect to multi-grained dimension coordinates
US13/217,206 US20110307512A1 (en) 2006-12-22 2011-08-24 Disambiguation with respect to multi-grained dimension coordinates

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US11/615,694 Continuation US8036859B2 (en) 2006-12-22 2006-12-22 Disambiguation with respect to multi-grained dimension coordinates

Publications (1)

Publication Number Publication Date
US20110307512A1 true US20110307512A1 (en) 2011-12-15

Family

ID=39367001

Family Applications (2)

Application Number Title Priority Date Filing Date
US11/615,694 Active 2028-09-13 US8036859B2 (en) 2006-12-22 2006-12-22 Disambiguation with respect to multi-grained dimension coordinates
US13/217,206 Abandoned US20110307512A1 (en) 2006-12-22 2011-08-24 Disambiguation with respect to multi-grained dimension coordinates

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US11/615,694 Active 2028-09-13 US8036859B2 (en) 2006-12-22 2006-12-22 Disambiguation with respect to multi-grained dimension coordinates

Country Status (2)

Country Link
US (2) US8036859B2 (en)
WO (1) WO2008079675A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8051075B2 (en) * 2007-09-24 2011-11-01 Merced Systems, Inc. Temporally-aware evaluative score
CN102415243B (en) * 2011-10-04 2013-10-30 吉林大学 Discrete-element-method-based corn threshing process analysis method

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070233651A1 (en) * 2006-03-31 2007-10-04 International Business Machines Corporation Online analytic processing in the presence of uncertainties

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5832496A (en) * 1995-10-12 1998-11-03 Ncr Corporation System and method for performing intelligent analysis of a computer database
JP3952518B2 (en) * 1996-03-29 2007-08-01 株式会社日立製作所 Multidimensional data processing method
GB2336007B (en) * 1998-04-01 2003-01-29 Mitel Corp Agent-based data mining and warehousing
US6735593B1 (en) * 1998-11-12 2004-05-11 Simon Guy Williams Systems and methods for storing data
US6763353B2 (en) * 1998-12-07 2004-07-13 Vitria Technology, Inc. Real time business process analysis method and apparatus
US6356900B1 (en) * 1999-12-30 2002-03-12 Decode Genetics Ehf Online modifications of relations in multidimensional processing
US6434557B1 (en) * 1999-12-30 2002-08-13 Decode Genetics Ehf. Online syntheses programming technique
US6831668B2 (en) * 2000-04-03 2004-12-14 Business Objects, S.A. Analytical reporting on top of multidimensional data model
US7451389B2 (en) * 2000-06-06 2008-11-11 Microsoft Corporation Method and system for semantically labeling data and providing actions based on semantically labeled data
US20020099563A1 (en) * 2001-01-19 2002-07-25 Michael Adendorff Data warehouse system
US6826568B2 (en) * 2001-12-20 2004-11-30 Microsoft Corporation Methods and system for model matching
CA2418753A1 (en) * 2002-02-12 2003-08-12 Cognos Incorporated Method and system for database join disambiguation
US7152073B2 (en) * 2003-01-30 2006-12-19 Decode Genetics Ehf. Method and system for defining sets by querying relational data using a set definition language
CA2419502A1 (en) 2003-02-21 2004-08-21 Cognos Incorporated Time-based partitioned cube
US7653528B2 (en) * 2005-03-08 2010-01-26 Microsoft Corporation Resource authoring incorporating ontology
US7685119B2 (en) * 2006-12-20 2010-03-23 Yahoo! Inc. System and method for query expansion

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070233651A1 (en) * 2006-03-31 2007-10-04 International Business Machines Corporation Online analytic processing in the presence of uncertainties

Also Published As

Publication number Publication date
WO2008079675A1 (en) 2008-07-03
US8036859B2 (en) 2011-10-11
US20080154556A1 (en) 2008-06-26

Similar Documents

Publication Publication Date Title
Tjioe et al. Mining association rules in data warehouses
US8924264B2 (en) System, process and software arrangement for providing multidimensional recommendations/suggestions
US6684206B2 (en) OLAP-based web access analysis method and system
Datta et al. A case for parallelism in data warehousing and OLAP
Gray et al. Data cube: A relational aggregation operator generalizing group-by, cross-tab, and sub-totals
US8583593B1 (en) Systems and methods for optimizing database queries
Gorla Features to consider in a data warehousing system
US20070061287A1 (en) Method, apparatus and program storage device for optimizing a data warehouse model and operation
Liang et al. Range queries in dynamic OLAP data cubes
US8166050B2 (en) Temporally-aware evaluative score
US20110307512A1 (en) Disambiguation with respect to multi-grained dimension coordinates
US8392358B2 (en) Temporal extent considerations in reporting on facts organized as a dimensionally-modeled fact collection
US8112387B2 (en) Reporting on facts relative to a specified dimensional coordinate constraint
Ankerst et al. Datajewel: Tightly integrating visualization with temporal data mining
AT&T
Yan et al. FlashP: An analytical pipeline for real-time forecasting of time-series relational data
WO2009076538A2 (en) Routing incipient transactions based on experiential data
Chu A contingency approach to estimating record selectivities
Rahman et al. Development of student data mart using normalized data store architecture
Goodge Problems of repertory grid analysis and a cluster analysis solution
Singh et al. Recommending next query in an OLAP session
Dehuri et al. Parallel processing of OLAP queries using a cluster of workstations
Garcia Real time self-maintenable data warehouse
Srinivasa Query processing issues in data warehouses
Chadyuk et al. Information-Analytic system for problem solving of socioeconomic monitoring

Legal Events

Date Code Title Description
STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION