HK40010382B - Device and method of improving the perceptual luminance nonlinearity-based image data exchange across different display capabilities - Google Patents
Device and method of improving the perceptual luminance nonlinearity-based image data exchange across different display capabilities Download PDFInfo
- Publication number
- HK40010382B HK40010382B HK42019000107.3A HK42019000107A HK40010382B HK 40010382 B HK40010382 B HK 40010382B HK 42019000107 A HK42019000107 A HK 42019000107A HK 40010382 B HK40010382 B HK 40010382B
- Authority
- HK
- Hong Kong
- Prior art keywords
- gsdf
- values
- digital code
- display
- image data
- Prior art date
Links
Description
This application claims priority to  U.S. Provisional Patent Application No. 61/567,579 filed 6 December 2011   ;  U.S. Provisional Patent Application No. 61/674,503 filed 23 July 2012   ; and  U.S. Provisional Patent Application No. 61/703,449 filed 20 September 2012   .
    This application is a European divisional application of European patent application  EP 16166823.1    (reference: D11082EP02), for which EPO Form 1001 was filed 25 April 2016.  EP 16166823.1    is itself a European divisional application of Euro-PCT patent application  EP 12808594.1    (reference: D11082EP01), filed 06 December 2012 and granted as  EP 2 788 855 on 01 June 2016  .
    The present invention relates generally to image data. More particularly, an embodiment of the present invention relates to perceptual nonlinearity-based image data exchange across different display capabilities.
    Advances in technology allow modern display designs to render image and video content with significant improvements in various quality characteristics over the same content, as rendered on less modem displays. For example, some more modern displays are capable of rendering content with a dynamic range (DR) that is higher than the standard dynamic range (SDR) of conventional or standard displays.
    For example, some modem liquid crystal displays (LCDs) have a light unit (a backlight unit, a side light unit, etc.) that provides a light field in which individual portions may be modulated separately from modulation of the liquid crystal alignment states of the active LCD elements. This dual modulation approach is extensible (e.g., to N-modulation layers wherein N comprises an integer greater than two), such as with controllable intervening layers (e.g., multiple layers of individually controllable LCD layers) in an electro-optical configuration of a display.
    In contrast, some existing displays have a significantly narrower dynamic range (DR) than high dynamic range (HDR). Mobile devices, computer pads, game devices, television (TV) and computer monitor apparatus that use typical cathode ray tube (CRT), liquid crystal display (LCD) with constant fluorescent white back lighting or plasma screen technology may be constrained in their DR rendering capability to approximately three orders of magnitude. Such existing displays thus typify a standard dynamic range (SDR), sometimes also referred to as "'low' dynamic range" or "LDR," in relation to HDR.
    Images captured by HDR cameras may have a scene-referred HDR that is significantly greater than dynamic ranges of most if not all display devices. Scene-referred HDR images may comprise large amounts of data, and may be converted into post-production formats (e.g., HDMI video signals with 8 bit RGB, YCbCr, or deep color options; 1.5 Gbps SDI video signals with a 10 bit 4:2:2 sampling rate; 3 Gbps SDI with a 12 bit 4:4:4 or 10 bit 4:2:2 sampling rate; and other video or image formats) for facilitating transmission and storage. Post-production images may comprise a much smaller dynamic range than that of scene-referred HDR images. Furthermore, as images are delivered to end users' display devices for rendering, device-specific and/or manufacturer-specific image transformations occur along the way, causing large amounts of visually noticeable errors in rendered images in comparison with the original scene-referred HDR images.
    The European search report cites  WO 2007/014681 A1   ("D1") and  EP 1 391 865 A1   ("D2").
    D1 describes a method to calibrate a display system such that the display system is conforming to an enforced standard for a wider range of parameters, e.g. viewing angles, than compared to traditional calibration methods. This is obtained by calculating an optimised set of calibration parameters for the display to be conform to the enforced standard for the selected range of parameters.
    D2 describes a device and method for processing video picture data for display on a display device having a plurality of luminous elements corresponding to pixels of a video picture, wherein the brightness of each pixel is controlled by sub-field code words corresponding to a number of impulses for switching on and off the luminous elements, by dithering said video picture data and sub-field coding the dithered video picture data for displaying. The brightness values of the video picture data are mapped to brightness levels taking the human contrast sensitivity into account and are based on the Weber-Fechner-law.
    The approaches described in this section are approaches that could be pursued, but not necessarily approaches that have been previously conceived or pursued. Therefore, unless otherwise indicated, it should not be assumed that any of the approaches described in this section qualify as prior art merely by virtue of their inclusion in this section. Similarly, issues identified with respect to one or more approaches should not assume to have been recognized in any prior art on the basis of this section, unless otherwise indicated.
    The invention is defined by the independent claims, taking due account of any element which is equivalent to an element specified in the claims. The dependent claims concern optional features of some embodiment of the invention.
    Merely the embodiments related to equation (10) having the exemplary constants n, m, c1, c2 and c3 as defined in the independent claims represent embodiments of the presently claimed invention. All other occurrence of the word "embodiment(s)" refer to examples, which were originally filed but which do not represent embodiments of the presently claimed invention; these examples are still shown for illustrative purpose only.
    The present invention is illustrated by way of example, and not by way of limitation, in the figures of the accompanying drawings and in which like reference numerals refer to similar elements and in which:
 - FIG. 1 illustrates an example family of contrast sensitivity function curves that span across a plurality of light adaptation levels, according to an example embodiment of the present invention;
- FIG. 2 illustrates an example integration path, according to an example embodiment of the present invention;
- FIG. 3 illustrates an example gray scale display function, in accordance with an example embodiment of the present invention;
- FIG. 4 illustrates a curve depicting Weber fractions, according to an example embodiment of the present invention;
- FIG. 5 illustrates an example framework of exchange image data with devices of different GSDFs, according to an example embodiment of the present invention;
- FIG. 6 illustrates an example conversion unit, according to an example embodiment of the present invention;
- FIG. 7 illustrate an example SDR display, according to an example embodiment of the present invention;
- FIG. 8A and FIG. 8B illustrate example process flows, according to an example embodiment of the present invention;
- FIG. 9 illustrates an example hardware platform on which a computer or a computing device as described herein may be implemented, according an example embodiment of the present invention;
- FIG. 10A illustrates maximums for code errors in units of JNDs in a plurality of code spaces each with a different one of one of one or more different bit lengths, according to some example embodiments;
- FIG. 10B through FIG. 10E illustrate distributions of code errors, according to some example embodiments; and
- FIG. 11 illustrates values of parameters in a functional model, according to an example embodiment.
Example embodiments, which relate to perceptual luminance nonlinearity-based image data exchange across displays of different capabilities, are described herein. In the following description, for the purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of the present invention. It will be apparent, however, that the present invention may be practiced without these specific details. In other instances, well-known structures and devices are not described in exhaustive detail, in order to avoid unnecessarily occluding, obscuring, or obfuscating the present invention.
    Example embodiments are described herein according to the following outline:
 - 1. GENERAL OVERVIEW
- 2. CONTRAST SENSITIVITY FUNCTION (CSF) MODEL
- 3. PERCEPTUAL NONLINEARITY
- 4. DIGITAL CODE VALUES AND GRAY LEVELS
- 5. MODEL PARAMETERS
- 6. VARIABLE SPATIAL FREQUENCIES
- 7. FUNCTIONAL MODELS
- 8. EXCHANGE IMAGE DATA BASED ON REFERENCE GSDF
- 9. CONVERTING REFERENCE ENCODED IMAGE DATA
- 10. EXAMPLE PROCESS FLOWS
- 11. IMPLEMENTATION MECHANISMS - HARDWARE OVERVIEW
- 12. ENUMERATED EXAMPLE EMBODIMENTS, EQUIVALENTS, EXTENSIONS, ALTERNATIVES AND MISCELLANEOUS
This overview presents a basic description of some aspects of an embodiment of the present invention. It should be noted that this overview is not an extensive or exhaustive summary of aspects of the embodiment. Moreover, it should be noted that this overview is not intended to be understood as identifying any particularly significant aspects or elements of the embodiment, nor as delineating any scope of the embodiment in particular, nor the invention in general. This overview merely presents some concepts that relate to the example embodiment in a condensed and simplified format, and should be understood as merely a conceptual prelude to a more detailed description of example embodiments that follows below.
    Human vision may not perceive a difference between two luminance values if the two luminance values are not sufficiently different from each other. Instead, human vision only perceives a difference if the luminance value differ no less than a just noticeable difference (JND). Due to perceptual nonlinearity of human vision, the amounts of individual JNDs are not uniformly sized or scaled across a range of light levels, but rather vary with different individual light levels. In addition, due to the perceptual nonlinearity, the amounts of individual JNDs are not uniformly sized or scaled across a range of spatial frequencies at a particular light level, but rather vary with different spatial frequencies below a cut-off spatial frequency.
    Encoded image data with luminance quantization steps of equal sizes or linearly scaled sizes does not match with perceptual nonlinearity of human vision. Encoded image data with luminance quantization steps at a fixed spatial frequency also does not match with perceptual nonlinearity of human vision. Under these techniques, when code words are assigned to represent quantized luminance values, too many code words may be distributed in a particular region (e.g., the bright region) of the range of light levels, while too few code words maybe distributed in a different region (e.g., the dark region) of the range of light levels.
    In the overpopulated region, a multitude of code words may not produce perceptual differences, and are, for all practical purposes, therefore wasted. In the underpopulated region, two adjacent code words may produce a perceptual difference much greater than a JND, and possibly produces contour distortion (also known as banding) visual artifacts.
    Under techniques as described herein, a contrast sensitivity function (CSF) model may be used to determine JNDs across a wide range (e.g., 0 to 12,000 cd/m2) of light levels. In an example embodiment, the peak JND as a function of spatial frequency at a particular light level is selected to represent a quantum of human perception at the particular light level. The selection of peak JNDs is in accord with the behaviors of human vision that adapts to a heightened level of visual perceptibility when a background of close but different luminance values is being viewed, which is sometimes referred to in video and image display fields as a crispening effect and/or Whittle's Crispening effect and may be described herein as such. As used herein, the term "a light adaption level" may be used to refer to a light level at which a (e.g., peak) JND is selected/determined, assuming that human vision is adapted to the light level. Peak JNDs as described herein vary over spatial frequency at different light adaptation levels.
    As used herein, the term "spatial frequency" may refer to a rate of spatial modulation/variation in images (wherein rate is computed in relation to or over spatial distance, in contrast to computing rate in relation to time). In contrast to conventional approaches that may fix spatial frequency at a specific value, the spatial frequency as described herein may vary, for example, in or over a range. In some embodiments, peak JNDs maybe limited within a particular spatial frequency range (e.g., 0.1 to 5.0, 0.01 to 8.0 cycles/degrees, or a smaller or larger range).
    A reference gray scale display function (GSDF) may be generated based on the CSF model. In some embodiments, a very wide field of view is assumed for the CSF model for generating a reference GSDF that better support entertainment display fields. The GSDF refers to a set of reference digital code values (or reference code words), a set of reference gray levels (or reference luminance values), and a mapping between the two sets. In an example embodiment, each reference digital code value corresponds to a quantum of human perception, as represented by a JND (e.g., a peak JND at a light adaptation level). In an example embodiment, an equal number of reference digital code values may correspond to a quantum of human perception.
    The GSDF may be obtained by accumulating JNDs from an initial value. In an example embodiment, a middle code word value (e.g., 2048 for a 12-bit code space) is given as an initial value to a reference digital code. The initial value of the reference digital code may correspond to an initial reference gray level (e.g., 100 cd/m2). Other reference gray levels for other values of the reference digital code may be obtained by positively accumulating (adding) JNDs as the reference digital code is incremented one by one, and by negatively accumulating (subtracting) JNDs as the reference digital code is decremented one by one. In an example embodiment, quantities such as contrast thresholds may be used in computing reference values in the GSDF, in place of JNDs. These quantities actually used in computation of a GSDF may be defined as unitless ratios and may differ from corresponding JNDs only by known or determinable multipliers, dividing factors and/or offsets.
    A code space may be selected to include all reference digital code values in the GSDF. In some embodiments, the code space in which all the reference digital code values reside may be one of a 10-bit code space, an 11-bit code space, a 12-bit code space, a 13-bit code space, a 14-bit code space, a 15-bit code space, or a larger or smaller code space.
    While a large code space (>15 bits) may be used to host all reference digital code values, in a particular embodiment, the most efficient code space (e.g., 10 bits, 12 bits, etc.) is used to host all reference digital code values generated in a reference GSDF.
    The reference GSDF may be used to encode image data, for example, captured or generated by HDR cameras, studio systems, or other systems with a scene-referred HDR that is significantly greater than dynamic ranges of most if not all display devices. The encoded image data may be provided to downstream devices in a wide variety of distribution or transmission methods (e.g., HDMI video signals with 8 bit RGB, YCbCr, or deep color options; 1.5 Gbps SDI video signals with a 10 bit 4:2:2 sampling rate; 3 Gbps SDI with a 12 bit 4:4:4 or 10 bit 4:2:2 sampling rate; and other video or image formats).
    In some embodiments, because adjacent reference digital code values in the reference GSDF correspond to gray levels that are within a JND, details for which human vision is capable of distinguishing may be completely or substantially preserved in the image data encoded based on the reference GSDF. A display that fully supports the reference GSDF may possibly render images with no banding or contour distortion artifacts.
    Image data encoded based on the reference GSDF (or reference encoded image data) may be used to support a wide variety of less capable displays that may not fully support all reference luminance values in the reference GSDF. Because the reference encoded image data comprises all the perceptual details in the supported luminance range (which may be designed to be a superset of what displays support), reference digital code values may be optimally and efficiently transcoded to display-specific digital code values in a way to preserve as much details as a specific display is capable of supporting and to cause as few visually noticeable errors as possible. Additionally and/or optionally, decontouring and dithering may be performed in conjunction with, or as a part of, transcoding from reference digital code values to display-specific digital code values to further improve image or video quality.
    Techniques as described herein are not color-space dependent. They may be used in a RGB color space, a YCbCr color space, or a different color space. Furthermore, techniques that derive reference values (e.g., reference digital code values and reference gray levels) using JNDs which vary with spatial frequency may be applied to a different channel (e.g., one of red, green, and blue channels) other than a luminance channel in a different color space (e.g., RGB) which may or may not comprise a luminance channel. For example, reference blue values may be derived in place of reference gray levels using JNDs which are applicable to the blue color channel. Thus, in some embodiments, gray scale may be substituted for color. Additionally and/or optionally, different CSF models may also be used instead of Barten's model. So may different model parameters be used for the same CSF model.
    In some embodiments, mechanisms as described herein form a part of a media processing system, including, but not limited to: a handheld device, game machine, television, laptop computer, netbook computer, cellular radiotelephone, electronic book reader, point of sale terminal, desktop computer, computer workstation, computer kiosk, or various other kinds of terminals and media processing units.
    Various modifications to the preferred embodiments and the generic principles and features described herein will be readily apparent to those skilled in the art. Thus, the disclosure is not intended to be limited to the embodiments shown, but is to be accorded the widest scope consistent with the principles and features described herein.
    Human visual sensitivity for spatial structures in rendered images may be best described with contrast sensitivity functions (CSFs), which describe contrast sensitivity as functions of spatial frequency (or rate of spatial modulation/variation in images as perceived by a human observer). As used herein, contrast sensitivity, S, may be considered as a gain in human visual neural signal processing, while contrast thresholds, CT, may be determined from the inverse of the contrast sensitivity, for example:   
    As used herein, the term "contrast threshold" may refer to, or relate to, a lowest value (e.g., a just-noticeable difference) of (relative) contrast necessary for human eyes to perceive a difference in contrast; in some embodiments, contrast thresholds may also be depicted as a function of the just-noticeable difference divided by the light adaptation level across a range of luminance values.
    In some embodiments, contrast thresholds may be directly measured in experiments, without use of any CSF model. In some other embodiments, however, contrast thresholds may be determined based on a CSF model. A CSF model may be built with a number of model parameters and maybe used to derive a GSDF whose quantization steps in gray levels depend on and vary with light levels characterized by luminance values and spatial frequency. An example embodiment may be implemented based on one or more of a variety of CSF models such as those described in Peter G. J. Barten, Contrast Sensitivity of the Human Eye and its Effects on Image Quality (1999) (herein after Barten's model or Barten's CSF model), or Scott Daly, Chapter 17 in Digital Images and Human Vision, ed., by A. B. Watson, MIT Press (1993) (hereinafter Daly's model). In relation to example embodiments of the present invention, contrast thresholds used to generate a reference grayscale display function (GSDF) may be derived experimentally, theoretically, with a CSF model, or a combination thereof.
    As used herein, a GSDF may refer to a mapping of a plurality of digital code values (e.g., 1, 2, 3, ... , N) to a plurality of gray levels (L1, L2, L3, ..., LN), in which the digital code values represent index values of contrast thresholds and the gray levels correspond to the contrast thresholds, as shown in TABLE 1.
 TABLE 1  
     
    | 1 | |
| 2 | |
| 3 | |
| ... | ... | 
| i-1 | |
| I | |
| i+1 | |
| ... | ... | 
| N | 
In an embodiment, a gray level (e.g., Li) corresponding to a digital code value (e.g., i) and an adjacent gray level (e.g., Li+1) may be computed in relation to a contrast (e.g., C(i)) as follows:    wherein C(i) represents a contrast for a luminance range that is bounded between Li and Li+1. Lmean(i, i+1) comprise an arithmetic average or mean of the two adjacent gray levels Li and Li+1. The contrast C(i) is arithmetically related to a Weber fraction ΔL/L by a factor of two. Here, ΔL represents (Li+1 - Li), and L represents one of Li, Li+1, or an intermediate value between Li and Li+1.
    In some embodiments, a GSDF generator may set the contrast C(i) to a value equal, or otherwise proportional, to a contrast threshold (e.g., CT(i)) at a luminance level L between Li and Li+1, inclusive, as follows:    wherein k represents a multiplicative constant. In relation to embodiments of the present invention, other descriptive statistics/definitions (e.g., geometric mean, medium, mode, variance, or standard deviation) and/or scaling (x2, x3, divided or multiplied by a scaling factor, etc.) and/or offsetting (+1, +2, -1, -2, subtracted or added by an offset, etc.) and/or weighting (e.g., assigning the two adjacent gray levels with the same or different weight factors) may be used to relate contrast thresholds with contrasts for the purpose of computing gray levels in a GSDF.
    As computed in expressions (1), (2) and (3), contrasts or contrast thresholds may comprise a relative value, and may thus comprise a unitless quantity (e.g., so S may also be unitless).
    A CSF model may be built up from basic contrast threshold measurements or computations based on a CSF that depicts the CSF model. Human vision is unfortunately complex, adaptive and nonlinear, so that the there is no single CSF curve that describes the human vision. Instead, a family of CSF curves may be generated based on a CSF model. Even with the same CSF model, different values of model parameters produce different plots for the family of CSF curves.
    In an example embodiment, a CSF function that gives rise to a family of CSF curves as illustrated in FIG. 1  maybe derived with the Barten's CSF model, which takes into account a number of key effects relating to human perception. An example CSF, S(u), (or the inverse of the corresponding contrast threshold, mt,) under the Barten's CSF model may be computed as shown in Expression (4), below.    
    The example model parameters used in expression (4) above comprise the representations listed below:
 - ∘ 2 (the numeric factor) corresponds to binocular vision (4 if monocular);
- ∘ k represents a signal/noise ratio, for example, 3.0;
- ∘ T represents an integration time of the eye, for example, 0.1 second;
- ∘ X0 represents an angular size of object (e.g., in a square shape);
- ∘ Xmax represents a maximum angular size of the integration area of the eye (e.g., 12degrees);
- ∘ Nmax represents a maximum number of cycles that are accumulated via probability summation, e.g., 15 cycles;
- ∘ η represents a quantum efficiency of the eye, e.g., .03;
- ∘ p represents a photon conversion factor;
- ∘ E represents a retinal illuminance, for example, in Troland units;
- ∘ Φ0 represents a spectral density of neural noise, e.g., 3x10-8 second ∗ degrees2; and
- ∘ u0 represents a maximum spatial frequency for lateral inhibition, e.g., 7 cycles/degrees.
The optical modulation transfer function, Mopt, may be given as follows:    where σ represents a model parameter related to pupil and/or light level.
    Barten's CSF model as discussed above may be used to describe perceptual nonlinearity relative to luminance. Other CSF models may also be used to describe perceptual nonlinearity. For example, Barten's CSF model does not account for the effect of accommodation, which causes a lowering of the cut-off spatial frequency in the high spatial frequency region of the CSF. This lowering effect due to accommodation may be expressed as a function of decreasing viewing distance.
    For example, for viewing distances over 1.5 meters, the maximum cutoff spatial frequency as depicted by Barten's CSF model may be achieved, without affecting the effectiveness of Barten's model as an appropriate model to describe perceptual nonlinearity. However, for distances less than 1.5 meters, the effect of accommodation starts to become significant, reducing the accuracy of Barten's model.
    Thus, for tablet displays, which have closer viewing distances, such as 0.5 meter, and smartphones, which can have viewing distances as close as 0.125 meter, Barten's CSF model may not be optimally tuned.
    In some embodiments, Daly's CSF model, which takes into account the accommodation effect, may be used. In a particular embodiment, Daly's CSF model may be constructed in part based on Barten's CSF, S(u), in expression (4) above, for example, by modifying the optical modulation transfer function, Mopt, in expression (5).
    A GSDF as illustrated in TABLE 1 maps perceptual nonlinearity using the digital code values to represent gray levels tied to contrast thresholds in human vision. The gray levels which comprise all the mapped luminance values may be distributed in such a way that they are optimally spaced to match the perceptual nonlinearity of human vision.
    In some embodiments, when the maximum number of gray levels in a GSDF are sufficiently large relative to the maximum range of luminance values, digital code values in the GSDF may be used in a way to achieve the lowest number (e.g., below a total of 4096 digital code values) of gray levels without causing the visibility of the gray level step transition (e.g., visible as a false contour or band in an image; or a color shift in dark regions of an image).
    In some other embodiments, a limited number of digital code values may still be used to represent a wide dynamic range of gray levels. For example, when the maximum number of grayscale levels in a GSDF are not sufficiently large relative to the maximum range of the grayscale levels (e.g., digital code values in an 8-bit representation with the range of grayscale levels from 0 to 12,000 nits), the GSDF may still be used in a way to achieve the lowest number (e.g., below a total of 256 digital code values) of gray levels to reduce or minimize the visibility of the gray level step transition. With such a GSDF, amounts/degrees of perceptible errors/artifacts of the step transition may be evenly distributed throughout the hierarchy of a relatively low number of gray levels in the GSDF. As used herein, the term "grayscale level" or "gray level" may be used interchangeably, and may refer to a represented luminance value (a quantized luminance value represented in a GSDF).
    Gray levels in a GSDF may be derived by stacking or integrating contrast thresholds across light adaptation levels (at different luminance values). In some embodiments, quantization steps between gray levels may be so chosen that a quantization step between any two adjacent gray levels lands within a JND. A contrast threshold at a particular light adaptation level (or luminance value) may be no more than the just-noticeable difference (JND) at that particular adaptation level. Gray levels may be derived by integrating or stacking fractions of contrast thresholds (or JNDs). In some embodiments, the number of digital code values is more than sufficient to represent all the JNDs in the represented dynamic range of luminance.
    Contrast thresholds, or inversely contrast sensitivities, that are used to compute grayscale levels may be selected from a CSF curve at a different spatial frequency other than a fixed spatial frequency for a particular light adaptation level (or luminance value). In some embodiments, each of the contrast thresholds is selected from a CSF curve at a spatial frequency that corresponds to a peak contrast sensitivity (e.g., due to Whittle's crispening effect) for a light adaptation level. In addition, contrast thresholds may be selected from CSF curves at different spatial frequencies for different light adaptation levels.
    An example expression to compute/stack the gray levels in the GSDF is as follows:    where f represents the spatial frequency, which may be other than a fixed number under techniques as described herein; and LA  represents the light adaptation level. Lmin  may be the lowest luminance value in all the mapped gray levels. As used herein, the term "Nit" or its abbreviation "nt" may relate or refer, synonymously or interchangeably, to a unit of image intensity, brightness, luma and/or luminance that is equivalent or equal to one (1) candela per square meter (1 Nit = 1 nt = 1 cd/m2). In some embodiments, Lmin  may comprise a value of zero. In some other embodiments, Lmin  may comprise a non-zero value (e.g., a certain dark black level, 10-5 nit, 10-7 nit, etc., which may be lower than what display devices are generally able to achieve). In some embodiments, Lmin  may be replaced with other than a minimum initial value, such as an intermediate value, or a maximum value, which allows stacking computations with subtraction or negative addition.
    In some embodiments, stacking of the JNDs to derive gray levels in a GSDF is performed by summation, for example, as shown in expression (6). In some other embodiments, an integral may be used in place of the discrete summation. The integral may integrate along an integration path determined from a CSF (e.g., expression (4)). For example, the integration path may comprise peak contrast sensitivities (e.g., different peak sensitivities corresponding to different spatial frequencies) for all light adaptation levels in a (reference) dynamic range for the CSF.
    As used herein, an integration path may refer to a visible dynamic range (VDR) curve used to represent human perceptual nonlinearity and to establish a mapping between a set of digital code values and a set of reference gray levels (quantized luminance values). The mapping may be required to meet the criteria that each quantization step (e.g., the luminance difference of two adjacent gray levels in TABLE 1) be less than the JNDs above or below a corresponding light adaptation level (luminance value). The instantaneous derivative (in units of nit/spatial-cycle) of the integration path at a particular light adaptation level (luminance value) is proportional to the JND at the particular adaptation level. As used herein, the term "VDR" or "visual dynamic range" may refer a dynamic range wider than a standard dynamic range, and may include, but is not limited to, a wide dynamic range up to the instantaneously perceivable dynamic range and color gamut which human vision can perceive at an instant.
    Based on techniques as described herein, a reference GSDF that is independent of any specific displays or image processing devices may be developed. In some embodiments, one or more model parameters other than light adaptation level (luminance), spatial frequency, and angular size may be set to constant (or fixed) values.
    In some embodiments, the CSF model is constructed with conservative model parameter values that cover a broad range of display devices. The use of the conservative model parameter values provides smaller JNDs than existing standard GSDFs. Accordingly, in some embodiments, the reference GSDF under the techniques described herein is capable of supporting luminance values with a high precision that exceeds the requirements of these display devices.
    In some embodiments, model parameters as described herein include a field-of-vision (FOV) parameter. The FOV parameter may be set to a value of 45 degrees, 40 degrees, 35 degrees, 30 degrees, 25 degrees, or another larger or smaller value that supports a wide range of display devices and viewing scenarios including those used in studios, theaters or high-end entertainment systems.
    Model parameters as described herein may include an angular size parameter, which may relate for example to a field of view. The angular size parameter may be set to a value of 45 degrees x 45 degrees, 40 degrees x 40 degrees, 35 degrees x 35 degrees, 30 degrees x 30 degrees, 25 degrees x 25 degrees, or another larger or smaller value that supports a wide range of display devices and viewing scenarios. In some embodiments, the angular size parameter used in part to derive the reference GSDF is set to n degrees x m degrees, where either of n and m may be a numeric value between 30 and 40, and n and m may or may not be equal.
    In some embodiments, a larger angular size (e.g., 40 degrees x 40 degrees) is used to produce a reference GSDF with a greater number of grayscale levels and thus more contrast sensitivity. The GSDF may be used to support a wide range of viewing and/or displaying scenarios (e.g., large screen video displays) which may require a wide viewing angle of ∼30 to 40 degrees. The GSDF having an increased sensitivity due to the selection of large angular size may also be used to support highly variable viewing and/or displaying scenarios (e.g., cinemas). It is possible to select even larger angular sizes; however, raising the angular size significantly above a certain angular size (e.g., 40 degrees) may produce relatively limited marginal benefits.
    In some embodiments, a reference GSDF model covers a large luminance range. For example, gray levels, or quantized luminance values, that are represented by the reference GSDF model ranges from 0 or approximately 0 (e.g., 10-7 cd/m2) to 12,000 cd/m2. The lower bound of the represented luminance values in the reference GSDF model maybe 10-7 cd/m2, or a lower or higher value (e.g., 0, 10-5, 10-8 , 10-9 cd/m2, etc.). The GSDF may be used to support a wide range of viewing and/or displaying scenarios with different ambient light levels. The GSDF may be used to support a wide range of display devices with different dark black levels (in theaters, indoor, or outdoor).
    The upper bound of the represented luminance values in the reference GSDF model may be 12,000 cd/m2, or a lower or higher value (e.g., 6000-8000, 8000-10000, 10000-12000, 12000-15000 cd/m2, etc.). The GSDF may be used to support a wide range of viewing and/or displaying scenarios with high dynamic ranges. The GSDF may be used to support a wide range of display devices with different maximum luminance levels (HDR TVs, SDR displays, laptops, tablets, handheld devices, etc.).
    As illustrated in FIG. 2 , peak contrast sensitivities do not occur at a fixed spatial frequency value, but rather occur at smaller spatial frequencies as light adaptation levels (luminance values) drop. This means that techniques (e.g., DICOM) with a fixed spatial frequency may significantly underestimate contrast sensitivities of human vision for the dark light adaptation levels (low luminance values). Lower contrast sensitivities give rise to higher contrast thresholds, resulting in larger quantization step sizes in quantized luminance values.
    Unlike the Digital Imaging and Communications in Medicine (DICOM) standard, a VDR curve under techniques as described herein does not fix the spatial frequency model parameter at a fixed value such as 4 cycles per degree. Rather, the VDR curve varies with the spatial frequency and accurately captures the peak contrast sensitivities of human vision at a plurality of light adaptation levels. The VDR curve properly takes into account the crispening effect due to human vision's adaptability to a wide range of light adaptation levels, and helps generate a high-precision reference GSDF. Here, the term "high-precision" means that perceptual errors due to quantization of luminance values are removed or substantially reduced based on a reference GSDF that best and most efficiently captures human visual nonlinearity within the constraint of a fixed size code space (e.g., one of 10 bits, 12 bits, etc.).
    A computation process may be used to calculate gray levels in the reference GSDF (e.g., TABLE 1). In an example embodiment, the computation process is iterative or recursive, repeatedly determines the contrast thresholds (or modulation threshold, for example, mt  in expression 4) from the VDR curve, and applies the contrast thresholds to obtain successive gray levels in the reference GSDF. This computation process may be implemented with the following expressions (7):    where j-1,j and j+1 represent indexes to three adjacent digital code values; Lj-1, Lj  and L  j+1  correspond to gray levels to which the digital code values j-1, j and j+1 are mapped, respectively. Lmax  and Lmin  respectfully represent maximum luminance value and minimum luminance value across a JND or a fraction of a JND. Using a JND or a fraction thereof maintains the high precision of the reference GSDF.
    The contrast threshold mt  associated with the JND may be defined as a relative quantity, e.g., a difference between Lmax  and Lmin, as divided by a particular luminance value of either Lmax or Lmin, or in between L max and Lmin (e.g., an average of Lmax and Lmin). In some embodiments, mt  may be alternatively defined as the difference between Lmax  and Lmin, as divided by a multiplier (e.g., 2) of a particular luminance value of either Lmax  or Lmin, or in between Lmax  and Lmin. In quantizing luminance values in a GSDF to a plurality of gray levels, Lmax  and Lmin  may refer to adjacent gray levels in the plurality of gray levels. As a result, Lj  may be related to Lj-i  and Lj+1  through mt, respectively, as shown in expression (7).
    In alternative embodiments, instead of using linear expressions as illustrated in expression (7), non-linear expression may be used to relate JNDs or contrast thresholds with gray levels. For example, an alternative expression based on the standard deviation divided by the mean may be used in place of a simple ratio for the contrast threshold as illustrated.
    In some embodiments, a reference GSDF covers a range of 0 to 12,000 cd/m2 with digital code values represented as 12-bit integer value. To further improve the precision of the reference GSDF, mt  may be multiplied with a fraction value f. Furthermore, a center digital value L2048 (note that the digital code values are at least limited to 0 and 4096 as in a 12-bit code space that is compatible with SDI) may be mapped to 100cd/m2. Expression (7) may yield the following expressions (8):    wherein the fraction value f is set to 0.918177. In the example embodiment, the minimum allowed value for the digital codes is set to code word (or integer value) 16 is set to 0 (cd/m2). The second lowest digital code value 17 ends up at 5.27x10-7 cd/m2, while the digital code value 4076 ends up at 12,000 cd/m2.
    One or more analytical functions may be used to obtain a mapping between digital code values and gray levels in a GSDF (reference GSDF or device-specific GSDF) as described herein. The one or more analytical functions may be proprietary, standard-based, or extensions from standard-based functions. In some embodiments, a GSDF generator (e.g., 504 of FIG. 5 ) may generate a GSDF in the form of one or more forward look-up tables (LUTs) and/or one or more inverse LUTs based on the one or more analytical functions (or formulas). At least some of these LUTs may be provided to a variety of image data codecs (e.g., 506 of FIG. 5 ) or a wide variety of display device to be used in converting between reference gray levels and reference digital code levels for the purpose of encoding reference image data. Additionally, optionally, or alternatively, at least some of the analytical functions (with their coefficients in integer or floating point representations) may be directly provided to image data codecs or a wide variety of display device to be used in obtaining mappings between digital code values and gray levels in a GSDF as described herein and/or converting between gray levels and digital code levels for the purpose of encoding image data.
    In some embodiments, analytical functions as described herein comprise a forward function that may be used to predict a digital code value based on a corresponding gray level, as follows:    where D represents a (e.g., 12 bit) value of digital code, L represents a luminance value or gray level in nits, n may represent a slope in a middle section of a logD/logL curve as given by expression (9), m may represent the sharpness of the knee of the logD/logL curve, and c1,c2 and c3 may define the end and middle points of the logD/logL curve.
    In some embodiments, the analytical functions comprise an inverse function that corresponds to the forward function in expression (9) and may be used to predict a luminance value based on a corresponding digital code value, as follows:   
    Digital code values predicted based on a plurality of luminance values using expression (9) may be compared with observed digital code values. The observed digital code values may be, but are not limited only to any of, numeric calculation based on a CSF model as previously discussed. In an embodiment, a deviation between the predicted digital code values and the observed digital code values may be computed and minimized to derive optimal values of the parameters n, m, c1, c2, and c3  in expression (9).
    Likewise, luminance values predicted based on a plurality of digital code values using expression (10) may be compared with observed luminance values. The observed luminance values may, but are not limited to, be generated using numeric computations based on a CSF model as previously discussed, or using human visual experimentation data. In an embodiment, the deviation between the predicted luminance values and the observed luminance values may be derived as a function of the parameters n, m, c1, c2, and c3  and minimized to derive optimal values of the parameters n, m, c1, c2, and c3  in expression (10).
    A set of optimal values of the parameters n, m, c1, c2, and c3  as determined with expression (9) may or may not be the same as a set of optimal values of the parameters n, m, c1, c2, and c3  as determined with expression (10). In case of differences between the two sets, one or both of the two sets may be used to generate a mapping between digital code values and luminance values. In some embodiments, the two sets of optimal values of the parameters n, m, c1, c2, and c3 , if different, may be harmonized, for example, based on minimization of round trip errors, which are introduced by performing both forward and inverse coding operations with both expressions (9) and (10). In some embodiments, multiple round trips may be made to study resultant errors in digital code values and/or in luminance values or gray levels. In some embodiments, selection of the parameters in expressions (9) and (10) may be based at least in part on a criterion that no significant error occurs in one, two, or more round trips. Examples of no significant round trip errors may include, but are not limited only to any of, errors smaller than 0.0001%, 0.001%, 0.01%, 0.1%, 1%, 2%, or other configurable values.
    Embodiments include using a code space of one of one or more different bit lengths to represent digital control values. Optimized values of the parameters in expressions (9) and (10) may be obtained for each of a plurality of code spaces each with a different one of one of one or more different bit lengths. Based on the optimized values of expressions (9) and (10), distributions of code errors (e.g., forward transformation errors, inverse transformation errors or round trip errors in digital code values based on expressions (9) and (10)) maybe determined. In some embodiments, a numeric difference of one (1) in two digital code values corresponds to a contrast threshold (or corresponds to a JND) at a light level between two luminance values represented by the two digital code values. FIG. 10A  illustrates maximums for code errors in units of JNDs in a plurality of code spaces each with a different one of one of one or more different precisions (with different bit lengths), according to some example embodiments. For example, based on functional models as described herein, the maximum code error for a code space of infinite or unlimited bit length is 11.252. In comparison, based on a functional model as described herein, the maximum code error for a code space of a 12 bit length (or 4096) is 11.298. This indicates that a code space of a 12 bit length for digital code values is an excellent choice with a functional model as represented by expressions (9) and (10).
    Embodiments include using a functional model with formulas other than those (which may be tone-mapping curves) given in expressions (9) and (10). For example, a cone model with a Naka-Rushton formula as follows may be used by a functional model as described herein:    wherein L represent luminance values, n, m and σ represent model parameters in association with the cone model, and, Ld  represents predicted values that may be encoded with digital code values. Similar methods of obtaining model parameters through minimizing deviations may be used to derive optimal values of the model parameters for expression (11). FIG. 10D  illustrates a distribution of code errors for a code space of the 12 bit length (or 4096) with a forward transformation (from luminance values to digital code values) as specified by expression (11), according to an example embodiment. In an embodiment, the maximum code error as illustrated in FIG. 10D  is 25 JNDs.
    In another example, a functional model may be generated with a Raised mu formula as follows:    wherein x represents luminance values, and y represents predicted digital code values. An optimal value of the model parameter µ may be obtained through minimizing deviations. FIG. 10E  illustrates a distribution of code errors for a code space of the 12 bit length (or 4096) with a forward transformation (from luminance values to digital code values) as specified by expression (12), according to an example embodiment. In an embodiment, the maximum code error as illustrated in FIG. 10E  is 17 JNDs.
    As illustrated herein, in some embodiments, a functional model may be used to predict code values from luminance values or predict luminance values from code values. Formulas used by the functional model may be invertible. Same or similar processing logic may be implemented to perform forward and inverse transformation between these values. In some embodiments, model parameters including but not limited only to any of exponents may be represented by fixed-point values or integer-based formulas. Thus, at least a part of the processing logic may be efficiently implemented in hardware only, software only, or a combination of hardware and software. Similarly, at least a part of LUTs generated with the functional model or model formulas (such as expressions (9) through (12)) may be efficiently implemented in hardware only, software only, or a combination of hardware and software (including ASIC or FPGA). In some embodiments, one, two, or more functional models may be implemented in a single computing device, a configuration of multiple computing devices, a server, etc. In some embodiments, errors in predicted code values may be within 14 code values of target or observed values over a full range of visible dynamic range of luminance values. In some embodiments, this holds true for both forward and inverse transformations. Same or different sets of model parameters may be used in forward and inverse transformations. Round-trip accuracy may be maximized with optimal values of the model parameters. Different code spaces may be used. In particular embodiment a code space of 12 bit length (4096) may be used to host digital code values with minimal code errors across the full range of visible dynamic range.
    As used herein, a reference GSDF may refer to a GSDF comprising reference digital code values and reference gray levels as related under a functional model (the model parameters of which may be determined with target or observed values under a CSF model), as determined with numeric computations (e.g., without determining any functional representation of a mapping between digital code values and luminance values) based on a CSF model, or as determined with data from human visual studies. In some embodiments, a device GSDF may also comprise a mapping between digital code values and gray levels that may be analytically represented with a functional model as described herein.
    For the purpose of illustration, it has been described that digital code values reside in a 12 bit code space. The present invention, however, is not so limited. Digital code values with different code spaces (e.g., different bit depths other than 12 bits) maybe used in a reference GSDF. For example, 10 bit integer values may be used to represent digital codes. Instead of mapping a digital code value 4076 to a luminance value 12000 cd/m2 in a 12-bit representation of digital codes, a digital code value 1019 may be mapped to the luminance value 12000 cd/m2 in a 10-bit representation of digital codes. Thus, these and other variations in code spaces (bit depths) may be used for digital code values in a reference GSDF.
    The reference GSDF may be used to exchange image data across different GSDFs which may be individually designed for each type of image acquisition device or image rendering device. For example, a GSDF implemented with a specific type of image acquisition device or image rendering device may implicitly or explicitly depend on model parameters that do not match model parameters of a standard GSDF or a device-specific GSDF with another type of image acquisition device or image rendering device.
    A reference GSDF may correspond to curve shapes as depicted in FIG. 3  and FIG. 4 . Generally speaking, the shapes of GSDFs depend on parameters used to derive or design the GSDFs. Hence, a reference GSDF depends on a reference CSF model and reference model parameters used to generate the reference GSDF from the reference CSF model. The curve shape of a device-specific GSDF depends on the specific device, including display parameters and viewing conditions if the specific device is a display.
    In an example, a display whose supported range of luminance values is limited to less than 500 cd/m2 may not experience the increase in slope at a high luminance value region (which occurs when the human vision shifts to a logarithmic behavior for all frequencies) as shown in FIG. 3 . Driving the display with a curve shape of FIG. 3  may lead to nonoptimal (e.g., suboptimal) allocation of gray levels, with too many gray levels allocated in the bright regions, and not enough allocated in the dark regions.
    In another example, a low contrast display is designed to be used outdoors in various daylight conditions. The display's luminance range may occur largely or almost entirely in the log behavior region of FIG. 3 . Driving this low contrast display with a curve shape of FIG. 3  may also lead to nonoptimal (suboptimal) allocation of gray levels, with too many gray levels allocated in the dark regions, and not enough allocated in the bright regions.
    Under techniques as described herein, each display may use its specific GSDF (dependent on not only the display parameters, but also on the viewing conditions which, for example, affect the actual black level) to optimally support perceptual information in image data encoded with a reference GSDF. The reference GSDF is used by one or more upstream (e.g., encoding) devices for the overall encoding of image data to preserve perceptual details as much as possible. The image data encoded in the reference GSDF is then delivered to one or more downstream (e.g., decoding) devices. In an example embodiment, encoding of image data based on the reference GSDF is independent of specific devices that are to subsequently decode and/or render the image data.
    Each device (e.g., display) has its specific GSDF where device-specific gray levels are supported/optimized. The specific gray levels may be known to the maker of the display, or may have been specifically designed by the maker to support the device-specific GSDF (which may or may not be standard based). The line driver of the device may be implemented with quantized luminance values specific to the device. Optimization may be best done for the device based on the quantized luminance values specific to the device. Additionally, the dark black level (e.g., the lowest device-specific gray level), which may be used as the lower bound to the range of device-specific gray levels, may be set based in part on the present ambient light level and/or the device's optical reflectivity (which may be known to the maker). Once the dark black level is so set, device-specific gray levels may be obtained or set by implicitly or explicitly accumulating (e.g., stacking/integrating) quantization steps in the line driver of the device. The derivation and/or adjustment of gray levels may or may not be done at runtime when the device is concurrently rendering images.
    Thus, under techniques as described herein, embodiments of the present invention may include, but are not limited only to, encoding image data with a reference GSDF and decoding and rendering the image data with a display-specific GSDF.
    Techniques as described herein may be used to exchange image data across a variety of devices with different GSDFs. FIG. 5  illustrates an example framework (500) of exchange image data with devices of different GSDFs, in accordance with an example embodiment of the present invention. As illustrated in FIG. 5 , an adaptive CSF model (502) may be used to generate a reference GSDF (504). The term "adaptive" may refer to the adaptability of a CSF model to human visual nonlinearity and behaviors. The adaptive CSF model may be built based at least in part on a plurality of CSF parameters (or model parameters). The plurality of model parameters include, for example, light adaptation level, display area in degree width, noise level, accommodation (physical viewing distance), luminance or color modulation vector (which may be, for example, related to test images or image patterns used in the adaptive CSF model (502)).
    An upstream (e.g., encoding) device may receive image data to be encoded with the reference GSDF (504) before the image data or its derivative is transmitted or distributed to downstream (e.g., decoding) devices. The image data to be encoded may initially be in any of a plurality of formats (standard based, proprietary, extension thereof, etc.) and/or may be derived from any of a plurality of image sources (camera, image server, tangible media, etc.). Examples of image data to be encoded include, but are not limited only to, raw or other high bit-depth image(s) 530. The raw or other high bit-depth image(s) may come from a camera, a studio system, an art director system, another upstream image processing system, an image server, a content database, etc. The image data may include, but is not limited only to, that of digital photos, video image frames, 3D images, non-3D images, computer-generated graphics, etc. The image data may comprise scene-referred images, device-referred images, or images with various dynamic ranges. Examples of image data to be encoded may include a high-quality version of original images that are to be edited, down-sampled, and/or compressed, along with metadata, into a coded bitstream for distribution to image receiving systems (downstream image processing system such as displays of various makers). The raw or other high bit-depth image(s) may be of a high sampling rate used by a professional, an art studio, a broadcast company, a high-end media production entity, etc. Image data to be encoded may also be in whole or in part computer generated, or may even be obtained based in whole or in part from existing image sources such as old movies and documentaries.
    As used herein, the phrase "image data to be encoded" may refer to image data of one or more images; the image data to be encoded may comprise floating-point or fixed-point image data, and may be in any color space. In an example embodiment, the one or more images may in an RGB color space. In another example embodiment, the one or more images may be in a YUV color space. In an example, each pixel in an image as described herein comprises floating-point pixel values for all channels (e.g., red, green, and blue color channels in the RGB color space) defined in the color space. In another example, each pixel in an image as described herein comprises fixed-point pixel values for all channels (e.g., 16 bits or higher/lower numbers of bits fixed-point pixel values for red, green, and blue color channels in the RGB color space) defined in the color space. Each pixel may optionally and/or alternatively comprise downsampled pixel values for one or more of the channels in the color space.
    In some embodiments, in response to receiving the image data to be encoded, an upstream device in the framework (500) maps luminance values as specified by or determined from the image data to reference digital code values in the reference GSDF, and generates, based on the image data to be encoded, reference encoded image data encoded with the reference digital code values. The mapping operation, from the luminance values based on the image data to be encoded to the reference digital code values, may include selecting reference digital code values whose corresponding reference gray levels (e.g., as shown in TABLE 1) match, or approximate as closely as any other reference luminance values in the reference GSDF, the luminance values as specified by or determined from the image data to be encoded and replacing the luminance values with the reference digital code values in the reference encoded image data.
    Additionally, optionally or alternatively, preprocessing and post processing steps (which may include, but are not limited only to, color space conversion, down sampling, upsampling, tone mapping, color grading, decompression, compression, etc.) may be performed as a part of generating the reference encoded image data.
    In an example embodiment, the framework (500) may comprise software and/or hardware components (e.g., an encode or format unit (506)) configured to encode and/or format the reference encoded image data into one or more coded bitstreams or image files. The coded bitstreams or image files may be in a standard-based format, a proprietary format, or an extension format based at least in part on a standard-based format. Additionally and/or optionally, the coded bitstreams or image files may comprise metadata containing one or more of related parameters (e.g., model parameters; minimum luminance value, maximum luminance value, minimum digital code value, maximum digital code value, etc., as illustrated in TABLE 1, FIG. 3  and FIG. 4 ; an identifying field that identifies a CSF among a plurality of CSFs; reference viewing distance) related to the reference GSDF, pre-processing or post processing used to generate the reference encoded image data.
    In some embodiments, the framework (500) may comprise one or more discrete upstream devices. For example, at least one of the one or more upstream devices in the framework (500) may be configured to encode image data based on the reference GSDF. The upstream devices may comprise software and/or hardware components configured to perform the functionality related to 502, 504, and 506, of FIG. 5 . The coded bitstreams or image files may be outputted by the upstream devices (502, 504, and 506, of FIG. 5 ) through network connections, digital interfaces, tangible storage media, etc., and delivered in an image data flow (508) to other image processing devices for processing or rendering.
    In some example embodiments, the framework (500) further comprises one or more downstream devices as one or more discrete devices. The downstream devices may be configured to receive/access, from the image data flow (508), the coded bitstreams or image files outputted by the one or more upstream devices. For example, the downstream devices may comprise software and/or hardware components (e.g., a decode or reformat unit (510)) configured to decode and/or reformat the coded bitstreams and image files, and recover/retrieve the reference encoded image data therein. As illustrated in FIG. 5 , the downstream devices may comprise a diverse set of display devices.
    In some embodiments, a display device (not shown) may be designed and/or implemented to support the reference GSDF. High-precision HDR image rendering may be provided if the display device supports each and every gray level in the reference GSDF. The display device may render images at details at a finer level than, or at the same level as, what human vision may possibly detect.
    In some embodiments, a display device's native digital code values (which may be implemented as digitized voltage values, e.g., digital drive levels or DDLs, in the display system) in a device-specific GSDF may correspond to device-specific gray levels (or luminance values) different from those in the reference GSDF. The device-specific gray levels may be designed to support sRGB, Rec. 709, or other specifications including those using representations related to complementary densities. Additionally, optionally, or alternatively, the device-specific gray levels may be based on the essential DAC characteristics of display driving.
    In some embodiments, a display device A (512-A) may be designed and/or implemented to support a device-specific GSDF A (514-A) of a visible dynamic range (VDR) display. GSDF A (514-A) may be based on a bit depth of 12 bits (a 12 bit code space) for device-specific digital code values, a 10,000:1 contrast ratio (CR), and a >P3 gamut. GSDF A (514-A) may support gray levels within a first sub-range (e.g., 0 to 5,000 cd/m2) in the entire range of the reference GSDF (504). Alternatively and/or optionally, GSDF A (514-A) may support the entire range (0 to 12,000 cd/m2, for example) in the reference GSDF (504) but may comprise fewer than all the reference gray levels in the reference GSDF (504).
    In some embodiments, a display device B (512-B) may be designed and/or implemented to support a device-specific GSDF B (514-B) for a dynamic range narrower than the VDR. For example, display device B (512-B) may be a standard dynamic range (SDR) display. As used herein, the terms "standard dynamic range" and "low dynamic range," and/or their corresponding abbreviations "SDR" and "LDR" may be used synonymously and/or interchangeably. In some embodiments, GSDF B (514-B) may support a bit depth of 8 bits for device-specific digital code values, a 500-5,000:1 contrast ratio (CR), and a color gamut as defined in Rec. 709. In some embodiments, GSDF B (514-B) may provide gray levels within a second sub-range (e.g., 0 to 2000 cd/m2) of the reference GSDF (504).
    In some embodiments, a display device C (512-C) may be designed and/or implemented to support a device-specific GSDF C (514-C) for a dynamic range even narrower than the SDR. For example, display device C (512-C) may be a tablet display. In some embodiments, GSDF C (514-C) may support a bit depth of 8 bits for device-specific digital code values, a 100-800:1 contrast ratio (CR), and a color gamut smaller than that defined in Rec. 709. In some embodiments, GSDF C (514-C) may support gray levels within a third sub-range (e.g., 0 to 1,200 cd/m2) of the reference GSDF (504).
    In some embodiments, a display device (e.g., display device D (512-D)) may be designed and/or implemented to supports a device-specific GSDF (e.g., GSDF D (514-D)) for a very limited dynamic range much narrower than the SDR. For example, display device D (512-D) may comprise an e-paper display. In some embodiments, GSDF D (514-D), may support a bit depth of 6 bits or less for device-specific digital code values; a contrast ratio (CR) of 10:1 or less, and a color gamut much smaller than that defined in Rec. 709. In some embodiments, GSDF D (514-D) may support gray levels within a fourth sub-range (e.g., 0 to 100 cd/m2) of the reference GSDF (504).
    Precision in image rendering may be gracefully scaled down with each of display devices A through D (512-A through -D). In some embodiments, the subset of gray levels in each of device specific GSDF A through D (514-A through -D) may be correlated with, or mapped to, supported reference gray levels in the reference GSDF (504) in such a way as to evenly distribute perceptually noticeable errors in the range of gray levels supported by that display device.
    In some embodiments, a display device (e.g., one of 512-A through -D) with a device-specific GSDF (e.g., one of 514-A through -D) receives/extracts reference encoded image data encoded based on a reference GSDF. In response, the display device, or a conversion unit (one of 516-A through -D) therein, maps reference digital code values as specified in the reference encoded image data, to device-specific digital code values that are native to the display device. This may be performed in one of several ways. In an example, mapping from the reference digital code values to the device-specific digital code values includes selecting device-specific gray levels (corresponding to the device-specific digital code values) that match, or approximate as closely as any other device-specific gray levels, the reference gray levels (corresponding to the reference digital code values). In another example, mapping from the reference digital code values to the device-specific digital code values includes (1) determining tone-mapped luminance values based on the reference gray levels (corresponding to the reference digital code values) associated with the reference GSDF, and (2) selecting device-specific gray levels (corresponding to the device-specific digital code values) that match, or approximate as closely as any other device-specific gray levels, the tone-mapped luminance values.
    Subsequently, the display device, or a driver chip (one of 518-A through -D) therein, may use the display-specific digital code values to render images with device-specific gray levels that correspond to the display-specific code values.
    Generally speaking, a reference GSDF may be based on a different CSF model than that on which a display-specific GSDF is based. Conversion/mapping between the reference GSDF and the device-specific GSDF is necessary. Even if the same CSF model is used to generate both the reference GSDF and a device-specific GSDF, different values of model parameters may be used in deriving the GSDFs. For the reference GSDF, model parameter values may be conservatively set to preserve details for a wide variety of downstream devices, while for the device-specific GSDF, model parameter values may reflect specific design/implementation and viewing conditions under which the display device is to render images. Conversion/mapping between the reference GSDF and the device-specific GSDF is still necessary, as the specific display device's viewing condition parameters (e.g., the ambient light level, the display device's optical reflectivity, etc.) are different from the model parameter values used to derive the reference GSDF. Here, the viewing condition parameters may include those that impinge display quality (e.g., contrast ratio, etc.) and elevate the black level (e.g., the lowest gray level, etc.). Conversion/mapping between the reference GSDF and the device-specific GSDF under techniques as described herein improves quality in image rendering (e.g., improve the contrast ratio by increasing luminance values at high value regions, etc.).
    Based on the definition data received, the conversion unit (516) cascades the reference GSDF with display-specific GSDF to form a conversion lookup table (Conversion LUT). Cascading between the two GSDFs may include comparing gray levels in the two GSDFs, and based on the results of comparing gray levels, establishing a mapping between reference digital code values in the reference GSDF and display-specific digital code values in the display-specific GSDF.
    More specifically, given a reference digital code value in the reference GSDF, its corresponding reference gray level maybe determined based on the reference GSDF. The reference gray level so determined may be used to locate a device-specific gray level in the display-specific GSDF. In an example embodiment, the device-specific gray level located may match, or approximate as closely as any other display-specific gray levels in the display-specific GSDF, the reference gray level. In another example embodiment, a tone-mapped luminance value may be obtained by a global or local tone-mapping operator acting on the reference gray level; the device-specific gray level located may match, or approximate as closely as any other display-specific gray levels in the display-specific GSDF, the tone-mapped luminance value.
    With the device-specific gray level, a corresponding display-specific digital code value may be identified from the display-specific GSDF. An entry may be added or defined in the conversion LUT, consisting of the reference digital code value and the display-specific code value.
    The steps as described above may be repeated for other reference digital code values in the reference GSDF.
    In some embodiments, a conversion LUT may be pre-built and stored before image data whose processing is to be done based in part on the conversion LUT is received and processed. In alternative embodiments, image data that is to be processed with a conversion LUT is analyzed. The results of the analysis may be used to set up or at least adjust correspondence relationships between the reference digital code values and the device-specific digital code values. For example, if the image data indicates a particular concentration or distribution of luminance values, the conversion LUT may be set up in a way to preserve a large amount of details in the concentrated region of luminance values.
    In some embodiments, the conversion unit (516) comprises one or more software and/or hardware components (a comparison sub-unit (602)) configured to compare quantization steps (e.g., luminance value differences, or ΔLs, between adjacent digital code values) in both the reference GSDF and display-specific GSDF (514-A). For example, the quantization step at a reference digital code value in the reference GSDF may be a reference luminance value difference (reference GSDF ΔL), while the quantization step at a display-specific digital code value in the display-specific GSDF may be a display-specific luminance value difference (display-specific GSDF ΔL). Here, the display-specific digital code value corresponds to (or forms a pair in the conversion LUT with) the reference digital code value. In some embodiments, the comparison sub-unit (602) compares these two luminance value differences. This operation is essentially a test which may be performed either based on ΔL values, or optionally and/or alternatively, based on the relative slopes of the two GSDF curves.
    Quantization steps for luminance values in the display-specific GSDF may typically exceed those of the reference GSDF, as one or more reference gray levels from the reference GSDF (e.g., corresponding to a high bit-depth domain, etc.) are merged into display-specific gray levels from the display-specific GSDF (e.g., corresponding to a low bit-depth domain, etc.). In these cases, dithering is used to remove banding artifacts. As part of overall dithering, dithering is also performed on local surrounding output pixels (in space and/or in time). In a sense, the human eye may be represented as a low-pass filter. At least in this sense, averaging local surrounding pixels as described herein thus creates desired output gray levels that reduce and/or remove banding visual artifacts, which otherwise could be present due to large quantization steps in the display-specific GSDF.
    In less common cases, quantization steps for luminance values for the reference GSDF may occasionally exceed those of the display-specific GSDF. A decontouring algorithm-based process is used, synthesizing an output gray level based on an input gray level, for example, by averaging neighboring input pixels.
    Correspondingly, if the reference GSDF ΔL is greater than the display-specific GSDF ΔL, which is the "Y" path in FIG. 6 , then a decontour algorithm flag is set for an entry, in the conversion LUT, that comprises the reference digital code value and the display-specific digital code value.
    If the reference GSDF ΔL is less than the display-specific GSDF ΔL, which is the "N" path in FIG. 6 , then a dither algorithm flag is set for an entry, in the conversion LUT, that comprises the reference digital code value and the display-specific digital code value.
    If the reference GSDF ΔL is equal to the display-specific GSDF ΔL, then neither a decontour algorithm flag nor a dither algorithm flag is set for an entry, in the conversion LUT, that comprises the reference digital code value and the display-specific digital code value.
    Decontour and dither algorithm flags may be stored with entries in the conversion LUT, or may be stored in a related data structure outside, but operatively linked with, the conversion LUT.
    In some embodiments, the conversion unit (516) is configured to receive reference encoded image data, which may be in the form of high bit-depth or floating point input image, and to map reference digital code values specified in the reference GSDF to display-specific digital code values specified in the display-specific GSDF. In addition to mapping digital code values between the GSDFs, the conversion unit (516) maybe configured to perform decontouring or dithering based on the settings of algorithm flags (decontour algorithm flags or dithering algorithm flags) previously discussed.
    As noted, the reference GSDF likely contains a greater amount of details than a display-specific GSDF; thus, the "Y" path of FIG. 6  may not occur, or may occur less often. In some embodiments, the "Y" path and related processing may be omitted to simplify the implementation of a conversion unit.
    In some embodiments, given a reference digital code value as determined for a pixel in the reference encoded image data, the conversion unit (516) looks up in the conversion LUT for a corresponding display-specific digital code value, and replaces the reference digital code value with the corresponding display-specific digital code value. Additionally and/or optionally, the conversion unit (516) determines whether a decontour or dithering algorithm should be performed for the pixel, based on the existence/setting of an algorithm flag for an entry, in the conversion LUT, that comprises the reference digital code value and the display-specific digital code value.
    If it is determined that neither a decontour algorithm nor a dithering algorithm should be performed (e.g., no indication or flag for performing either algorithm), then no decontour or dithering is performed for the pixel for the time being.
    If it is determined that a decontour algorithm should be performed, then the conversion unit (516) may perform one or more decontour algorithms (Decontour Algo). Performing the one or more decontour algorithms may include receiving image data of input local neighborhood pixels and inputting the image data of the local neighborhood pixels to the decontour algorithms.
    If it is determined that a dithering algorithm should be performed, then the conversion unit (516) may perform one or more dithering algorithms (Dithering Algo).
    The pixel may still be involved in decontour or dithering if the conversion unit (516) determines that decontour or dithering needs to be performed with respect to neighborhood pixels. In an example, the device-specific (output) gray level of the pixel may be used for dithering local neighborhood pixels. In another example, the reference (input) gray level of the pixel may be used for decontouring local neighborhood pixels.
    In some embodiments, the conversion unit (516) outputs the processing results of the foregoing steps to downstream processing units or sub-units. The processing results comprise display-specific encoded image data in the format of display-specific bit-depth output image encoded with digital code values in the display-specific GSDF (e.g., GSDF-A).
    Additionally and/or optionally, the SDR display (700) comprises a display management unit (704) that maintains display parameters for the SDR display (700). The display parameters may at least in part define a display-specific GSDF (e.g., GSDF-B of FIG. 5 ) associated with the SDR display (700). The display parameters defining the display-specific GSDF may include maximum (max) and minimum (min) gray levels supported by the SDR display (700). The display parameters may also include color primaries (primaries) supported by the SDR display, display size (size), optical reflectivity of the SDR display's image rendering surface, ambient light level. Some of the display parameters may be preconfigured with fixed values. Some of the display parameters may be measured in real-time or near real-time by the SDR display (700). Some of the display parameters may be configurable by a user of the SDR display (700). Some of the display parameters may be preconfigured with default values and may be overridden by measurement or by a user. The display management unit (704) establishes/shapes perceptual nonlinearity of the display-specific gray levels based on the reference GSDF, and may additionally and/or optionally perform tone mapping as a part of establishing/shaping the display-specific gray levels. For example, a conversion LUT as illustrated in FIG. 5  and/or other related meta data (e.g., dithering and decontour processing flags, etc.) may be established by the display management unit (704) for the purpose of establishing/shaping perceptual nonlinearity of the display-specific gray levels in accordance with the reference GSDF. Cascading operations as previously discussed may be implemented with the display management unit (704) to create the conversion LUT and/or other related metadata (712) relating to one or both of the reference GSDF and display-specific GSDF. The conversion LUT and/or other related metadata (712) may be accessed and used by other units or sub-units in the SDR display (700). Further, the conversion LUT and/or other related metadata may be used as, or to derive, metadata (714) for inverting perceptual nonlinearity. As used herein, inverting perceptual nonlinearity may include converting display-specific digital code values to display-specific digital driving levels (e.g., digitized voltage levels in the display device).
    Additionally and/or optionally, the SDR display (700) includes a conversion unit (516) as illustrated in FIG. 5  and FIG. 6 , and an 8 bit perceptual quantizer (706). In some embodiments, the SDR display (700), or the conversion unit (516) and the 8 bit perceptual quantizer (706) therein, converts the reference encoded image data into a display-specific bit-depth output image encoded with display-specific digital code values associated with the display-specific GSDF (e.g., GSDF-A or GSDF-B of FIG. 5 ), and quantizes the display-specific bit-depth output image into perceptually encoded image data in a 8 bit code space. As used herein, the term "perceptually encoded" may refer to a type of encoding that is based on a human visual perceptual model, such as a CSF that gives rise to the reference GSDF.
    Additionally and/or optionally, the SDR display (700) comprises a video post-processing unit (708) that may, but is not limited only to, perform zero, one, or more of image processing operations on the perceptually encoded image data in an 8 bit luminance representation. These image processing operations may include, but are not limited only to, compression, decompression, color space conversion, downsampling, upsampling, or color grading. The results of these operations may be outputted to other parts of the SDR display (700).
    In an example embodiment, the SDR display (700) comprises an 8 bit inverse perceptual quantizer (710) configured to convert display-specific digital code values in the results of image processing operations to display-specific digital driving levels (e.g., digitized voltage levels). The display-specific digital driving levels generated (or converted back from digital code values) by the inverse perceptual quantizer (710) may specifically support one of several types of luminance nonlinearities supportable in the SDR display (700). In an example, the inverse perceptual quantizer (710) converts display-specific digital code values to display-specific digital driving levels to support luminance nonlinearities associated with Rec. 709. In another example, the inverse perceptual quantizer (710) converts display-specific digital code values to display-specific digital driving levels to support luminance nonlinearities associated with a linear luminance domain or a log luminance domain (which may be relatively easy to be integrated with local dimming operations). In another example, the inverse perceptual quantizer (710) converts display-specific digital code values to display-specific digital driving levels to support a display-specific CSF (or its associated GSDF), with optimal placement of display-specific gray levels for the specific display (700), and possibly adjusted for the viewing conditions specific to the display (700).
    In block 804, the computing device encodes, based on a reference mapping between a set of reference digital code values and a set of reference gray levels, the image data to be encoded into reference encoded image data. Here, luminance values in the image data to be encoded are represented by the set of reference digital code values. A luminance difference between two reference gray levels represented by two adjacent reference digital code values in the set of reference digital code values may be inversely proportional to a peak contrast sensitivity of human vision adapted at a particular light level.
    In block 806, the computing device outputs the reference encoded image data.
    In an embodiment, the computing device determines a reference gray scale display function (GSDF) based on a contrast sensitivity function (CSF) model; the reference GSDF specifies the reference mapping between the set of reference digital code values and the set of reference gray levels. The CSF model comprises one or more model parameters, which may have an angular size that falls in a range comprising one or more of: between 25 degrees x 25 degrees and 30 degrees x 30 degrees, between 30 degrees x 30 degrees and 35 degrees x 35 degrees, between 35 degrees x 35 degrees and 40 degrees x 40 degrees, between 40 degrees x 40 degrees and 45 degrees x 45 degrees, or greater than 45 degrees x 45 degrees.
    In an embodiment, the computing device assigns an intermediate luminance value, within a range of luminance values supported by the set of reference gray levels, to an intermediate digital code value in a code space that hosts the set of reference digital code values, and derives, by performing one or more of stacking or integration computations, a plurality of sub-mappings, each sub-mapping maps a reference digital code value in the set of reference digital code values to a reference gray level in the set of reference gray levels. The intermediate luminance value may be selected within a range comprising one or more of: less than 50 nits, between 50 nits and 100 nits inclusive, between 100 and 500 nits inclusive, or no less than 500 nits.
    In an example embodiment, the set of reference gray levels covers a dynamic range with an upper limit having a value of: less than 500 nits, between 500 nits and 1000 nits inclusive, between 1000 and 5000 nits inclusive, between 5000 nits and 10000 nits inclusive, between 10000 nits and 15000 nits inclusive, or greater than 15000 nits.
    In an embodiment, the peak contrast sensitivity is determined from a contrast sensitivity curve among a plurality of contrast sensitivity curves determined based on a contrast sensitivity function (CSF) model having model parameters comprising one or more of a luminance value variable, a spatial frequency variable, or one or more other variables.
    In an embodiment, at least two peak contrast sensitivities determined based on at least two contrast sensitivity curves in the plurality of contrast sensitivity curves occur at two different spatial frequency values.
    In an embodiment, the computing device converts one or more input images represented, received, transmitted, or stored with the image data to be encoded from an input video signal into one or more output images represented, received, transmitted, or stored with the reference encoded image data contained in an output video signal.
    In an embodiment, the image data to be encoded comprises image data encoded in one of a high-resolution high dynamic range (HDR) image format, a RGB color spaces associated with the Academy Color Encoding Specification (ACES) standard of the Academy of Motion Picture Arts and Sciences (AMPAS), a P3 color space standard of the Digital Cinema Initiative, a Reference Input Medium Metric/Reference Output Medium Metric (RIMM/ROMM) standard, an sRGB color space, a RGB color space associated with the BT.709 Recommendation standard of the International Telecommunications Union (ITU), etc.
    In an embodiment, the luminance difference between the two reference gray levels represented by the two adjacent reference digital code values is less than a just noticeable difference threshold at the particular light level.
    In an embodiment, the particular light level is a luminance value between the two luminance values, inclusive.
    In an embodiment, the set of reference digital code values comprises integer values in a code space with a bit depth of: less than 12 bits; between 12 bits and 14 bits, inclusive; at least 14 bits; 14 bits or more.
    In an embodiment, the set of reference gray levels may comprise a set of quantized luminance values.
    In block 854, the computing device receives reference encoded image data encoded with the set of reference digital code values. Luminance values in the reference encoded image data are based on the set of reference digital code values. A luminance difference between two reference gray levels represented by two adjacent reference digital code values in the set of reference digital code values may be inversely proportional to a peak contrast sensitivity of human vision adapted at a particular light level.
    In block 856, the computing device transcodes, based on the digital code mapping, the reference encoded image data encoded with the set of reference digital code values into device-specific image data encoded with the set of device-specific digital control codes. Luminance values in the device-specific image data are based on the set of device-specific digital code values.
    In an embodiment, the computing device determines a set of correspondence relationships between the set of reference digital code values and the set of device-specific digital code values. Here, a correspondence relationship in the set of correspondence relationship relates a reference digital code value in the set of reference digital code values to a device-specific digital code value. The computing device further compares a first luminance difference at the reference digital code value and a second luminance difference at the device-specific digital code value, and stores, based on comparing the first luminance difference and the second luminance difference, an algorithm flag as to whether dithering, decontouring, or no operation should be performed for the reference digital code value.
    In an embodiment, the computing device determines a reference digital code value from the reference encoded image data for a pixel, and further determines whether an algorithm flag is set for the reference digital code value. In response to determining that an algorithm flag is set for decontour, the computing device performs a decontour algorithm on the pixel. Alternatively, in response to determining that an algorithm flag is set for dithering, the computing device performs a dithering algorithm on the pixel.
    In an embodiment, the computing device renders one or more images on a display based on the device-specific image data encoded with the set of device-specific digital control codes. Here, the display may be, but is not limited only to, one of a visible dynamic range (VDR) display, a standard dynamic range (SDR) display, a tablet computer display, or a handheld device display.
    In an embodiment, a device-specific gray scale display function (GSDF) specifies the device-specific mapping between the set of device-specific digital code values and the set of device-specific gray levels.
    In an embodiment, the device-specific mapping is derived based on one or more display parameters and zero or more viewing condition parameters.
    In an embodiment, the set of device-specific gray levels covers a dynamic range with an upper limit having a value of: less than 100 nits, no less than 100 nits but less than 500 nits, between 500 nits and 1000 nits inclusive, between 1000 and 5000 nits inclusive, between 5000 nits and 10000 nits inclusive, or greater than 10000 nits.
    In an embodiment, the computing device converts one or more input images represented, received, transmitted, or stored with the reference encoded image data from an input video signal into one or more output images represented, received, transmitted, or stored with the device-specific image data contained in an output video signal.
    In an embodiment, the device-specific image data supports image rendering in one of a high-resolution high dynamic range (HDR) image format, a RGB color spaces associated with the Academy Color Encoding Specification (ACES) standard of the Academy of Motion Picture Arts and Sciences (AMPAS), a P3 color space standard of the Digital Cinema Initiative, a Reference Input Medium Metric/Reference Output Medium Metric (RIMM/ROMM) standard, an sRGB color space, or a RGB color space associated with the BT.709 Recommendation standard of the International Telecommunications Union (ITU).
    In an embodiment, the set of device-specific digital code values comprises integer values in a code space with a bit depth of: 8 bits; greater than 8 but less than 12 bits; 12 bits or more.
    In an embodiment, the set of device-specific gray levels may comprise a set of quantized luminance values.
    In various embodiments, an encoder, a decoder, a system, etc., performs any or a part of the foregoing methods as described.
    According to one embodiment, the techniques described herein are implemented by one or more special-purpose computing devices. The special-purpose computing devices may be hard-wired to perform the techniques, or may include digital electronic devices such as one or more application-specific integrated circuits (ASICs) or field programmable gate arrays (FPGAs) that are persistently programmed to perform the techniques, or may include one or more general purpose hardware processors programmed to perform the techniques pursuant to program instructions in firmware, memory, other storage, or a combination. Such special-purpose computing devices may also combine custom hard-wired logic, ASICs, or FPGAs with custom programming to accomplish the techniques. The special-purpose computing devices may be desktop computer systems, portable computer systems, handheld devices, networking devices or any other device that incorporates hard-wired and/or program logic to implement the techniques.
    For example, FIG. 9  is a block diagram that illustrates a computer system 900 upon which an example embodiment of the invention may be implemented. Computer system 900 includes a bus 902 or other communication mechanism for communicating information, and a hardware processor 904 coupled with bus 902 for processing information. Hardware processor 904 may be, for example, a general purpose microprocessor.
    Computer system 900 also includes a main memory 906, such as a random access memory (RAM) or other dynamic storage device, coupled to bus 902 for storing information and instructions to be executed by processor 904. Main memory 906 also may be used for storing temporary variables or other intermediate information during execution of instructions to be executed by processor 904. Such instructions, when stored in non-transitory storage media accessible to processor 904, render computer system 900 into a special-purpose machine that is customized to perform the operations specified in the instructions.
    Computer system 900 further includes a read only memory (ROM) 908 or other static storage device coupled to bus 902 for storing static information and instructions for processor 904. A storage device 910, such as a magnetic disk or optical disk, is provided and coupled to bus 902 for storing information and instructions.
    Computer system 900 may be coupled via bus 902 to a display 912, such as a liquid crystal display, for displaying information to a computer user. An input device 914, including alphanumeric and other keys, is coupled to bus 902 for communicating information and command selections to processor 904. Another type of user input device is cursor control 916, such as a mouse, a trackball, or cursor direction keys for communicating direction information and command selections to processor 904 and for controlling cursor movement on display 912. This input device typically has two degrees of freedom in two axes, a first axis (e.g., x) and a second axis (e.g., y), that allows the device to specify positions in a plane.
    Computer system 900 may implement the techniques described herein using customized hard-wired logic, one or more ASICs or FPGAs, firmware and/or program logic which in combination with the computer system causes or programs computer system 900 to be a special-purpose machine. According to one embodiment, the techniques herein are performed by computer system 900 in response to processor 904 executing one or more sequences of one or more instructions contained in main memory 906. Such instructions may be read into main memory 906 from another storage medium, such as storage device 910. Execution of the sequences of instructions contained in main memory 906 causes processor 904 to perform the process steps described herein. In alternative embodiments, hard-wired circuitry may be used in place of or in combination with software instructions.
    The term "storage media" as used herein refers to any non-transitory media that store data and/or instructions that cause a machine to operation in a specific fashion. Such storage media may comprise non-volatile media and/or volatile media. Non-volatile media includes, for example, optical or magnetic disks, such as storage device 910. Volatile media includes dynamic memory, such as main memory 906. Common forms of storage media include, for example, a floppy disk, a flexible disk, hard disk, solid state drive, magnetic tape, or any other magnetic data storage medium, a CD-ROM, any other optical data storage medium, any physical medium with patterns of holes, a RAM, a PROM, and EPROM, a FLASH-EPROM, NVRAM, any other memory chip or cartridge.
    Storage media is distinct from but may be used in conjunction with transmission media. Transmission media participates in transferring information between storage media. For example, transmission media includes coaxial cables, copper wire and fiber optics, including the wires that comprise bus 902. Transmission media can also take the form of acoustic or light waves, such as those generated during radio-wave and infra-red data communications.
    Various forms of media may be involved in carrying one or more sequences of one or more instructions to processor 904 for execution. For example, the instructions may initially be carried on a magnetic disk or solid state drive of a remote computer. The remote computer can load the instructions into its dynamic memory and send the instructions over a telephone line using a modem. A modem local to computer system 900 can receive the data on the telephone line and use an infra-red transmitter to convert the data to an infra-red signal. An infra-red detector can receive the data carried in the infra-red signal and appropriate circuitry can place the data on bus 902. Bus 902 carries the data to main memory 906, from which processor 904 retrieves and executes the instructions. The instructions received by main memory 906 may optionally be stored on storage device 910 either before or after execution by processor 904.
    Computer system 900 also includes a communication interface 918 coupled to bus 902. Communication interface 918 provides a two-way data communication coupling to a network link 920 that is connected to a local network 922. For example, communication interface 918 may be an integrated services digital network (ISDN) card, cable modem, satellite modem, or a modem to provide a data communication connection to a corresponding type of telephone line. As another example, communication interface 918 may be a local area network (LAN) card to provide a data communication connection to a compatible LAN. Wireless links may also be implemented. In any such implementation, communication interface 918 sends and receives electrical, electromagnetic or optical signals that carry digital data streams representing various types of information.
    Network link 920 typically provides data communication through one or more networks to other data devices. For example, network link 920 may provide a connection through local network 922 to a host computer 924 or to data equipment operated by an Internet Service Provider (ISP) 926. ISP 926 in turn provides data communication services through the world wide packet data communication network now commonly referred to as the "Internet" 928. Local network 922 and Internet 928 both use electrical, electromagnetic or optical signals that carry digital data streams. The signals through the various networks and the signals on network link 920 and through communication interface 918, which carry the digital data to and from computer system 900, are example forms of transmission media.
    Computer system 900 can send messages and receive data, including program code, through the network(s), network link 920 and communication interface 918. In the Internet example, a server 930 might transmit a requested code for an application program through Internet 928, ISP 926, local network 922 and communication interface 918.
    The received code may be executed by processor 904 as it is received, and/or stored in storage device 910, or other non-volatile storage for later execution.
    The following Table 3 describes the calculation of the Perceptual Curve EOTF for converting digital video code values into absolute linear luminance levels at the point of display. Also included is the inverse OETF calculation for converting absolute linear luminance into digital code values.
 Table 3. Exemplary Specification for Perceptual Curve EOTF 
     
    | Notes: | |
| 1. The operator INT returns the value of 0 for fractional parts in the range of 0 to 0.4999... and +1 for fractional parts in the range of 0.5 to 0.9999..., i.e. it rounds up fractions above 0.5. | |
| 2. All constants are defined as exact multiples of 12 bit rationals to avoid rounding concerns. | |
| 3. R, G, or B signal components are to be computed in the same way as the Y signal component described above. | |
The following Table 4 shows exemplary values for 10 bits.
  Claims (6)
-  A method comprising:- receiving, by a data decoder, encoded image data; and- decoding, by the data decoder, the encoded image data, characterized in that said decoding comprises mapping digital code values in the encoded image data to normalized luminance values based at least in part on a functional model of:-  The method of claim 1, wherein a relationship between the normalized value V and the corresponding digital code value D is provided by the function:-  The method of claim 2, wherein the bit depth b is 10 bits.
-  The method of claim 2, wherein the bit depth b is 12 bits.
-  The method of claim 1, wherein the digital code value D represents a sampled value of a component of the image data in a serial digital interface (SDI) signal.
-  A bitstream comprising image data, characterized in that the image data is encoded with digital code values D that represent normalized luminance Y based at least in part on an inverse of a functional model ofwherein V is a normalized value of the digital code value D, with 0 ≤ Y ≤ 1 and 0 ≤ V ≤ 1;wherein n, m, c1 , c2 , and c3 are predetermined values, with:
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title | 
|---|---|---|---|
| US201161567579P | 2011-12-06 | ||
| US201261674503P | 2012-07-23 | ||
| US201261703449P | 2012-09-20 | 
Publications (2)
| Publication Number | Publication Date | 
|---|---|
| HK40010382A HK40010382A (en) | 2020-07-03 | 
| HK40010382B true HK40010382B (en) | 2021-04-09 | 
Family
ID=
Similar Documents
| Publication | Publication Date | Title | 
|---|---|---|
| AU2023204114B2 (en) | Device and method of improving the perceptual luminance nonlinearity-based image data exchange across different display capabilities | |
| HK40112223A (en) | Method of improving the perceptual luminance nonlinearity-based image data exchange across different display capabilities | |
| HK40112223B (en) | Method of improving the perceptual luminance nonlinearity-based image data exchange across different display capabilities | |
| HK40049320A (en) | Device and method of improving the perceptual luminance nonlinearity-based image data exchange across different display capabilities | |
| HK40049320B (en) | Device and method of improving the perceptual luminance nonlinearity-based image data exchange across different display capabilities | |
| HK40049879B (en) | Computer program for and computer readable medium having stored a bitstream for improving the perceptual luminance nonlinearity-based image data exchange across different display capabilities | |
| HK40049879A (en) | Computer program for and computer readable medium having stored a bitstream for improving the perceptual luminance nonlinearity-based image data exchange across different display capabilities | |
| HK40010382A (en) | Device and method of improving the perceptual luminance nonlinearity-based image data exchange across different display capabilities | |
| HK40010382B (en) | Device and method of improving the perceptual luminance nonlinearity-based image data exchange across different display capabilities | |
| HK1229922B (en) | Device and method of improving the perceptual luminance nonlinearity-based image data exchange across different display capabilities | |
| HK1229922A1 (en) | Device and method of improving the perceptual luminance nonlinearity-based image data exchange across different display capabilities | |
| HK1197471B (en) | Device and method of improving the perceptual luminance nonlinearity-based image data exchange across different display capabilities |