Title of Invention

A DEVICE FOR CAPTURING AND VIEWING DATA

Abstract A device (4) for capturing and processing viewing data, which viewing data relate to the viewing behavior of users when viewing video data, which video data are reproduced by means of a display device (41) of the device (4), the device (4) including a feedback module (42) which feedback module (42) transmits the viewing data to an evaluation unit (2, 44) wherein the display device (41) is a virtual retinal display device which projects the video data directly on the retina (51) of the user, the virtual retinal display device (41) includes an eye position detection module (411), which, during projection of the video data, determines data on lines of sight of the user relative to the viewed video information by determining current eye positions of the user; and the feedback module (42) is set up such that it transmits the viewing data to the evaluation unit (2, 44) at least with the data on the lines of sight.
Full Text FORM 2
THE PATENTS ACT 1970
[39 OF 1970]
&
THE PATENTS RULES, 2003
COMPLETE SPECIFICATION
[See Section 10; rule 13]
"A DEVICE FOR CAPTURING AND VIEWING DATA"
SWISSCOM MOBILE AG, of Schwarztorstrasse 61, 3050 Bern, Switzerland,
The following specification particularly describes the invention and the manner in which it is to be performed:

Method, Device for Capturing Data about the Viewing of Video Data, and the Passing On of these Data to a Central Data Processing Facility
This invention relates to a method and a device for capturing and processing of viewing data. In particular, this invention relates to a method and 5 a device for capturing and processing of viewing data that concern the viewing behavior of users when viewing video data.
The viewing behavior of television viewers is statistically recorded and evaluated today, particularly for marketing purposes, on the one hand to find out which programs, or respectively which channels, are watched by whom and
10 how often, and on the other hand to obtain a qualitative assessment from registered television viewers of the program watched. A mobile data logging device which, for purposes of market research, monitors and logs the use of a television set by a user has been described in the patent application WO 94/15417 A. Conventional data logging devices and methods are not suitable,
15 however, for capturing the viewing behavior of users on the picture level, i.e. to record statistically the viewing behavior of individuals and/or groups when viewing concrete moving and/or still video data.
Described in the patent application WO 90/02453 is a system and a method for monitoring television viewers. According to WO 90/02453, light
20 beams that are reflected by the eyes of a viewer, located within a defined visual range in front of a television set, are registered by means of a suitable receiver, which is positioned on the television set, for example. In the receiver it is determined on the basis of the registered reflected light beams whether the respective viewer is looking at the television screen. According to WO
25 90/02453, data about the viewing time and the selected television channel are stored and are transmitted via the telephone network to a central unit. The system according to WO 90/02453 is limited to the registration of a viewer within a defined visual range, and can only detect whether the screen is being looked at by the viewer; thus no indications about picture regions or picture
30 objects, looked at by the viewer on the television screen, are possible.
It is an object of this invention to propose a new and better method, a new and better device as well as a new and better system for capturing and processing viewing data that make it possible to capture the viewing behavior


of users when viewing video data.
This object is achieved according to the invention in particular through the elements of the independent claims. Further advantageous embodiments follow moreover from the dependent claims and from the description.
5 This object is achieved through the present invention in particular in that
when viewing video data, for example still or moving pictures from transmitted television programs, reproduced, stored video sequences, pictures or graphics, data about lines of sight of a user relative to the viewed video data are determined in that the video data are projected through a virtual retinal display
10 device directly onto the retina of the user and current eye positions of the user are determined, and the viewing data, which contain at least these data about lines of sight, are transmitted to an evaluation unit, for instance to a central unit via a telecommunications network. This has the advantage that lines of sight of a user relative to viewed video data can be determined without it being thereby
15 necessary to also take into account horizontal or vertical head movements of the user. Made possible moreover is for viewing data about users' viewing habits when watching video data to be captured centrally, in particular data about which picture segments of reproduced video data are looked at. The recorded viewing data are then available in the central unit for further
20 evaluation; they can also be used, however, for starting and/or controlling interactive processes, particularly on the level of individual users, such as dialogues for surveys or product orders, for instance.
In a preferred embodiment variant, current eye positions are compared with predefined values, for instance in the device with the display or in the
25 central unit, and, on the basis of the result of this comparison, predefined
actions are triggered, for instance in the device with the display or in the central unit, e.g. order procedures or the transmission of information, in particular video data. This has the advantage that graphic user interfaces can thereby be achieved which can be controlled by the user, without using his hands, through
30 positioning of his eyes.
In an embodiment variant, the viewing data transmitted to the central unit include user identification data, which originate for instance from identification


modules, e.g. from SIM cards (Subscriber Identification Module), that are each assigned to the users. This makes it possible for evaluation and use of captured viewing data to be carried out on the level of individual users, as described above, or for additional known information about respective users to 5 be taken into consideration in the evaluation and further use of captured viewing data.
In an embodiment variant, the viewing data transmitted to the central unit include video identification data. This is especially advantageous when the source of the video data and the central unit for capturing the viewing data are 10 not implemented together, so that captured viewing data can be associated with the respective video data during their evaluation and further processing.
In an embodiment variant, the viewing data transmitted to the central unit include time indications. Depending upon the type of video data, for instance in the case of transmission of video data by television program, time indications 15 can be used to assign captured viewing data to the respective video data, for their evaluation and further use.


The captured and transmitted viewing data are preferably stored in the central unit, for example in a viewing database, whereby the viewing data can also be made available at later points in time, in particular for statistical evaluations.
In an embodiment variant, the above-mentioned telecommunications network is a mobile radio network, for example a GSM or UMTS network or another, for instance satellite-based, mobile radio network. This has the advantage that the capturing of individual viewing data when viewing video data can be carried out in a mobile way, independently of fixed network connections.
An embodiment of the present invention will be described in the following with reference to an example. The example of the embodiment is illustrated by the following sole attached figure:
Figure 1 shows a block diagram of the system, which block diagram presents schematically a central unit that is connected, via a telecommunications network, to a device, in particular a communications terminal, which communications terminal comprises a video display device that projects video data onto the retina of an eye and which includes an eye position detection module that determines current eye positions of a user.
Reference numeral 4 in Figure 1 refers to a device, in particular a communications terminal, for example a fixed-installed communications terminal 4, e.g. a telephone or a communication-capable personal computer that is able to exchange data with a central unit 2, over a fixed network 3, for example a public switched telephone network, an ISDN network (Integrated Services Digital Network), an IP-based network (Internet Protocol), or a WAN (Wide Area Network) or LAN (Local Area Network), or a mobile communications terminal 4, i.e. a mobile device 4, for example a mobile radio telephone or a communication-capable laptop or palmtop computer, which is able to exchange data with a central unit 2 via a mobile radio network, for instance a GSM or UMTS network, or another, for instance satellite-based, mobile radio network, for example with the aid of SMS messages (Short Message Services), USSD messages (Unstructured Supplementary Services Data), GPRS services (Generalized Packet Radio Service), or according to another suitable protocol, via the user information channel.

The central unit 2 is based, for example, on a commercially available communications server having a communications module 21 with the necessary hardware and software components to communicate with the communications terminals 4 via the telecommunications network 3. The central unit 2 is directly connected to the telecommunications network 3, or is connected via suitable network elements, for instance a Mobile Switching Station (MSC), and includes a database 24 that is implemented on the same, or on a separate, computer.
As shown in Figure 1, the communications terminal 4 includes a video display device 41 which reproduces video data through projection of corresponding picture signals onto the retina 51 of the eye 5 of the user of the communications terminal 4. The video data are, for example, still or moving pictures of transmitted television programs or reproduced, stored video sequences, pictures, or graphics, that are obtained from, or respectively supplied by, the central unit 2 or another video source 6 connected to the communications terminal 4 via a video interface with contacts, for instance a television receiver, a video playback device, for example a video cassette recorder, or a reproduction device for digital video data stored on data carriers.
A video display device 41, which can project picture signals directly onto the retina 51 of a viewer, a so-called virtual retinal display device (Virtual Retinal Display, VRD) has been described in the patent applications WO 94/09472 and WO 97/37339. These virtual retinal display devices can be supplied with video data via a video interface, for instance in the form of an RGB signal, an NTSC signal, a VGA signal or another formatted color or monochrome video or graphics signal. One skilled in the art will understand that it can be advantageous to adapt the virtual retinal display device described in the mentioned patent publications WO 94/09472 and WO 97/37339, or the video interface described there, in such a way that it is also able to receive efficiently other formats of television signals and in particular digital video data. By means of an interface module (not shown), television signals and video data can also be suitably adapted to the video interface, or respectively obtained video data can be converted such that they are able to be applied to the video interface.


The video display device 41 and the further components of the communications terminal 41 can be implemented in a common or separate housings, the video display device 41 being connected in a first housing via a wired or via a wireless interface to components in the second housing, for instance.
As shown schematically in Figure 1, the video display device 41 includes an eye position tracking module 411, which determines current eye positions of the user when viewing video data and is able to transmit them, via the above-mentioned, or an additional, wired or wireless interface, to a feedback module 42 of the communications terminal 4. An eye position tracking module (eye tracker) which determines current eye positions based on the position of the pupil 52 of a user, has also been described in the above-mentioned patent application WO 94/09472, and can be extended by one skilled in the art such that the determined eye position is available for components outside the video display device 41 via a suitable interface; depending upon the embodiment, values for both eyes can be made available.
The feedback module 42 of the communications terminal 4, for example a programmed software module that is executed on a processor of the communications terminal 4, transmits determined current eye positions of the user, if applicable together with other viewing data, to an evaluation unit, for instance a programmed software module in the communications terminal 4, or in particular with the aid of communications services of the communications terminal 4, over the telecommunications network 3 to the central unit 2. In the central unit 2, the transmitted viewing data with the current eye positions are received by the communications module 21 and are sent to the processing module 23.
Depending upon the embodiment variant and application, the communications terminal 4 includes further modules 43,44, 45, 46 which contribute data to the viewing data.
The time determining module 43 determines the current time, and transmits the determined current time to the feedback module 42, from where it is transmitted to the central unit 2 together with the determined current eye positions in the viewing data. Besides establishing the point in time of the determined eye positions, the time indication can also be used to identify the

video data viewed at this point in time, for instance if the television channel watched at this point in time is known.
The input module 44 makes it possible for a user to enter user data and to transmit these data to the central unit 2, by means of the feedback module 42, together with the viewing data or separately. User data are, for example, qualitative data, e.g. a number from an evaluation scale or instructions or responses, which are transmitted to the central unit 2. The input module 44 includes, for example, operating elements and correspondingly programmed software functions which are able to receive user data entered by means of the operating elements. The input module, however, can also be a programmed software module which transmits determined current eye positions to the central unit 2 as user data, for instance at specified times or in response to predefined signals or instructions which are transmitted from the video source ( or the central unit 2 to the communications terminal 4, or, in the function of the above-mentioned evaluation unit, compares determined current eye positions with predefined position values or with position values that are transmitted from the video source 6 or the central unit 2 to the communications terminal 4, and, on the basis of this comparison, carries out operations corresponding to the position values, initiates actions, and/or transmits instructions, responses or evaluations as user data to the central unit 2. The comparison operation can also be carried out in the central unit 2, which will be explained more closely later. Such an input module 44 therefore makes it possible to use the virtual retinal display device 41, or respectively the communications terminal 4 with the virtual retinal display 41, as graphic user interface, which can be controlled by the user through positioning his eyes in that, by means of the virtual retinal display device, GUI objects (Graphical User Interface), corresponding to the position values, in the picture regions are projected onto the retina of the user. Corresponding video data for such a graphic user interface can also be transmitted, for instance, by the central unit 2 to the communications terminal 4.
The identification module 45, for example an SIM card (Subscriber Identification Module) contains user identification data, for example an IMSI (International Mobile Subscriber Identity) and/or a personal biometric code, or key, which can be transmitted to the central unit 2 by the feedback module 42 together with other viewing data. This is especially useful when viewing data are further processed or evaluated in the central unit 2 on an individual user


level, or when, in the central unit 2, additional user-specific data, for instance name and address information from a subscriber database, are brought in for further processing of the viewing data.
The video identification module 46, for example a programmed software module, determines video identification data for current video data, for instance the relevant television channel, the title of a video with the current sequence number of the current video frame or other indications, and passes on the determined video identification data to the feedback module 42 for transmission to the central unit 2 with other viewing data.
Through the processing module 23 of the central unit 2, for example a programmed software module, the received viewing data are evaluated and/or stored in a viewing database 24. An immediate evaluation of the received viewing data in the processing module 23 makes sense especially when predefined actions are supposed to be triggered on the basis of the current eye positions contained therein. For example, the communications terminal 4 with the virtual retinal display device 41 can be used, as mentioned above, as graphic user interface that is controlled by the user through eye positioning. In this way eye positions corresponding to a predefined picture region of the reproduced video data can trigger actions in the central unit 2. For example, a products and/or services ordering method can be initiated by the processing module 23, or information, in particular video data, can be transmitted via the telecommunications network 3 to the communications terminal 4 for reproduction via the display device 41, whereby in particular GUI applications of the client/server type can also be achieved. Stored viewing data can also be evaluated statistically, for instance at a later point in time. For example, which and how many viewers have viewed, or respectively have not viewed, particular picture regions of reproduced video data can be studied, which can be of interest for the evaluation of advertising films, for instance. In a further variant, in evaluating the eye positions, the processing module 23 can also take into account identified picture objects contained in the video data, so that the correlation of the eye positions with these identified objects can be studied. To carry out this last variant, it can be advantageous, for example, to analyze respective video data in advance with suitable image processing means such that their pictorial content can be described in abstract form, for instance through object designations, vectors and/or data on coordinates. Such abstract

content descriptions can be stored in the database 24, for instance together with the respective video data, and can be supplied to the processing module 23. Captured viewing data can also be stored, for instance user-specifically, as a user profile, and made further use of.
It should be explicitly stated here that, in the device 4, the virtual retinal display device 41 together with the input module 44 in the function of an evaluation unit can be used as GUI user interface without data having to be exchanged thereby with the central unit 2, which has the advantage that the device 4 can be controlled without use of other operating elements or the hands of a user, which can also be of interest in particular for non¬communication-capable computers.
Complete devices 4, as described, in particular communications terminals 4, can be sold or leased to an interested user. It can also be of commercial interest to sell expansion sets that include the necessary components to extend a conventional device, in particular a conventional communications terminal, into a described device 4, in particular a described communications terminal 4, which expansion sets also include in particular a data carrier with programmed feedback module 42, programmed input module 44, programmed video identification module 46 stored thereon and, if applicable, a time determining module 43. Whole systems can also be offered under license to interested operators, or data carriers can be sold to them containing a programmed communications module 21, processing module 23, and, if applicable, a viewing database 24 to operate a conventional communications server, which includes the hardware components needed by the communications module 21, as the described central unit 2.


List of Reference Numerals
1 system
2 center
3 telecommunications network (mobile radio network)
4 device (communications terminal, mobile device)
5 eye
6 video source
21 communications module
23 processing module
24 database (viewing database)

41 video display device (virtual retinal display device)
42 feedback module
43 time determining module
44 input module
45 identification module (SIM card)
46 video identification module

51 retina
52 pupil
411 eye position detection module

We claim:
1. A device (4) for capturing and processing viewing data, which viewing
data relate to the viewing behavior of users when viewing video data,
which video data are reproduced by means of a display device (41) of the
device (4), the device (4) including a feedback module (42) which
feedback module (42) transmits the viewing data to an evaluation unit (2,
44) wherein
the display device (41) is a virtual retinal display device which projects
the video data directly on the retina (51) of the user,
the virtual retinal display device (41) includes an eye position detection
module (411), which, during projection of the video data, determines data
on lines of sight of the user relative to the viewed video information by
determining current eye positions of the user; and
the feedback module (42) is set up such that it transmits the viewing
data to the evaluation unit (2, 44) at least with the data on the lines of
sight.
2. The device (4) as claimed in claim 1, wherein the feedback module (43) is set up such that it transmits the viewing data via a telecommunications network (3) to a central unit (2).
3. The device as claimed in one of the claims 1 or 2, wherein the device (4) includes means (44) of comparing the current eye positions with predefined values, and of triggering predefined actions on the basis of the result of this composition.
4. The device as claimed in one of the claims 1 to 3, wherein the device (4) includes an identification module (45) assigned to the user, with user identification data, and the viewing data include the user identification data.
5. The device (4) as claimed in one of the claims 1 to 4, wherein the device (4) includes a video identification module (46) which video identification


module (46) determines video identification data associated with the video data, and the viewing data include the video identification data.
The device (4) as claimed in one of the claims 1 to 5, wherein the device (4) includes a time determining module (43) which determines the current time, and the viewing data include time indications.
The device (4) as claimed in one of the claims 2 to 6, wherein the device (4) is designed as a mobile device, and the telecommunications network (3) is a mobile radio network via which mobile radio network (3) the device (4) is able to communicate.
this the 10th day of December, 2001.
(RANJNA MEHTA-DUTT) Of Remfry & Sagar Attorney for the Applicants

Documents:

in-pct-2001-01567-mum-cancelled pages(17-5-2005).pdf

in-pct-2001-01567-mum-claims(granted)-(17-5-2005).doc

in-pct-2001-01567-mum-claims(granted)-(17-5-2005).pdf

in-pct-2001-01567-mum-correspondence(17-5-2005).pdf

in-pct-2001-01567-mum-correspondence(ipo)-(6-11-2006).pdf

in-pct-2001-01567-mum-form 1a(10-12-2001).pdf

in-pct-2001-01567-mum-form 1a(17-5-2005).pdf

in-pct-2001-01567-mum-form 2(granted)-(17-5-2005).doc

in-pct-2001-01567-mum-form 2(granted)-(17-5-2005).pdf

in-pct-2001-01567-mum-form 3(10-12-2001).pdf

in-pct-2001-01567-mum-form 3(15-7-2002).pdf

in-pct-2001-01567-mum-form 3(17-5-2005).pdf

in-pct-2001-01567-mum-form-pct-isa-210(17-5-2005).pdf

in-pct-2001-01567-mum-petition under rule 137(17-5-2005).pdf

in-pct-2001-01567-mum-power of authority(10-12-1999).pdf

in-pct-2001-01567-mum-power of authority(17-5-2005).pdf


Patent Number 203652
Indian Patent Application Number IN/PCT/2001/01567/MUM
PG Journal Number 19/2007
Publication Date 11-May-2007
Grant Date 06-Nov-2006
Date of Filing 10-Dec-2001
Name of Patentee SWISSCOM MOBILE AG
Applicant Address SCHWARZTORSTRASSE 61, 3050 BERN, SWITZERLAND.
Inventors:
# Inventor's Name Inventor's Address
1 RUDOLF RITTER ROSSWEIDWEG 8, CH-3052 ZOLLIKOFEN, SWITZERLAND.
2 ERIC LAUPER HOCHFELDSTRASSE 96, CH-3012 BERN, SWITZERLAND.
PCT International Classification Number N/A
PCT International Application Number PCT/CH99/00268
PCT International Filing date 1999-06-18
PCT Conventions:
# PCT Application Number Date of Convention Priority Country
1 NA