Title of Invention

"APPARATUS AND METHOD FOR CLASSIFYING A PLURALITY OF DATA FLOWS IN A ROUTER"

Abstract The present invention relates to a method and system for supporting in a router a plurality of data flows using a ternary content addressable memory (TCAM) in which the number of accesses to write to the TCAM is optimized to improve efficiency of updating and subsequent look up. To accommodate the plurality of data flows, the TCAM is partitioned into at least two partitions in which a first portion includes indices having a higher priority and a second portion includes indices having a lower priority. For example, multiple protocol label switching (MPLS) flows and IP-Virtual Private Network (VPN) can be added to the first partition and policy based routing flows can be added to the second partition. During subsequent TCAM look-up of a prefix of an incoming packet the MPLS or IP-VPN flow will subsume any matching policy based routing flow, such as flows classified by an access control list or traffic manager flows.
Full Text 1. Field of the Invention
This invention relates to an apparatus and method for classifying a plurality of data flows in a router and more particularly to selectively routing a plurality of data flows, such as, Multi-Protocol Label Switching ("MPLS"), Internet Protocol (IP) Virtual Private Network ("VPN") data packets and policy based routing data packets, using a ternary content addressable memory ("TCAM").
Background of the Invention
Network providers are interested in providing centralized network services to meet customer demands. By taking advantage of the latest advances in IP quality of service ("QoS"), multiprotocol label switching ("MPLS"), and service transformation technology (the conversion of non-IP services to IP), service providers can evolve dedicated IP infrastructures into a multi-service network architecture, as an alternative to operating separate service-specific networks.
MPLS is a standards-approved technology for speeding up network traffic flow and making it easier to manage. MPLS involves setting up a specific path for a given sequence of packets, identified by a label put in each packet, thereby saving the time needed for a router to look up the address to the next node. MPLS is called multiprotocol because it works with the Internet Protocol ("IP"), Asynchronous Transport Mode ("ATM"), and various frame relay network protocols. MPLS allows most packets to be forwarded at the layer 2 (switching) level of the standard Open Systems Interconnection ("OSI") rather than at the layer 3 (routing) level. In addition to moving traffic faster overall, MPLS makes it easy to manage a network for quality of service ("QoS"). For these reasons, the technique is expected to be readily adopted as networks begin to carry more and different mixtures of traffic.
The essence of MPLS is the generation of a short fixed-length"label"that acts as a shorthand representation of an IP packet's header and the use of that label to make forwarding decisions about the packet. Typically, IP data packets are routed from source
to destination through a series of routers which receive the IP data packet, read the source and/or destination addresses and re-transmit the IP data packet either to the destination indicated as indicated by the IP destination addressed contained in the IP data packet or to another router which will forward the IP data packet until the IP data packet reaches the destination address, referred to as hop by hop routing. IP packet headers have fields for IP source and/or destination addresses. Routing protocols such as Routing Information Protocol ("RIP") and Open Shortest Path First ("OSPF") enable each machine to understand which other machine in the "next hop" that a packet should take toward its destination.
In MPLS, the IP packets are encapsulated with labels by the first MPLS device they encounter as they enter the network. The MPLS edge router analyses the contents of ^the IP header and selects an appropriate label with which to encapsulate the packet. In contrast to conventional IP routing, the router analysis can be based on more than just the destination address carried in the IP header. At all the subsequent nodes within the network the MPLS label, and not the IP header, is used to make the forwarding decision for the packet. As MPLS labeled packets leave the network, another edge router removes the labels. In MPLS terminology, the packet handling nodes or routers are called Label Switched Routers (LSRs). MPLS routers forward packets by making switching decisions based on the MPLS label. There are two broad categories of LSR: MPLS edge routers, which are high performance packet classifiers that apply (and remove) the requisite label at the edge of the network; and Core LSRs which are capable of processing the labeled packets at extremely high bandwidths.
Traditional routing solutions for efficient use of IP addressing have included using a content addressable memory (CAM) device for storing IP addresses. A CAM is a storage device that can be instructed to compare a specific pattern of comparand data with data stored in its associative CAM array. The entire CAM array, or segments thereof, are searched in parallel for a match with the comparand data. If a match exists, the CAM device indicates the match by asserting a. match flag. Multiple matches may also be indicated by asserting a multiple match flag. The CAM device typically includes a priority encoder to translate the highest priority matching location into a match address or CAM index. The generally fast parallel search capabilities of CAMs have proven
useful in many applications including address filtering and lookups in routers and networking equipment, policy enforcement in policy-based routers, pattern recognition for encryption/decryption and compression/decompression applications, and other pattern recognition applications.
Binary CAM cells are able to store two states of information: a logic one state and a logic zero state. Binary CAM cells typically include a RAM cell and a compare circuit. The compare circuit compares the comparand data with data stored in the RAM cell and drives a match line to a predetermined state when there is a match. Columns of binary CAM cells may be globally masked by mask data stored in one or more global mask registers. Ternary CAM cells are mask-per-bit CAM cells that effectively store three states of information, namely: a logic one state, a logic zero state, and a don't care state J for compare operations. Ternary CAM cells typically include a second RAM cell that stores local mask data for the each ternary CAM cell. The local mask data masks the comparison result of the comparand data with the data stored in the first RAM cell such that the comparison result does not affect the match line. The ternary CAM cell offers more flexibility to the user to determine on an entry-per-entry basis which bits in a word will be masked during a compare operation.
U.S. Patent No. 6,237.061 describes a system in which Classless Inter-Domain Routing (CIDR) addresses are pre-sorted and loaded into the ternary CAM such that the CAM entry having the longest prefix is located at the highest numerical address or index. The prefix portions of the CIDR addresses are used to set the masks cells associated with each CAM entry such that during compare operations, only the unmasked prefix portion of each CAM entry, which may correspond to a network ID field, is compared to an incoming destination address stored as the CAM search key. Since each CAM entry is masked according to an associated prefix value, the ternary CAM requires only one search operation to locate the CAM entry having the longest matching prefix.
Some other network services which are offered by network providers include Internet Protocol (IP) Virtual Private Networks (VPN) to interconnect various customer sites that are geographically dispersed. VPNs offer privacy and cost efficiency through network infrastructure sharing. U.S. patent No. 6,205,488 describes a virtual private
network including multiple routers connected to a shared MPLS network which are configured to dynamically distribute VPN information across the shared MPLS network.
Policy-based routing services have also been described to allow customers to implement policies that selectively cause packets to take different paths. Conventional applications of policy based routing have included: source based transit provider selection for routing traffic originating from different sets of users through different Internet connections across the policy routers; quality of service (QOS) for prioritizing traffic based on the type of service; and cost savings for distributing traffic between low-bandwidth, low cost permanent paths and high-bandwidth, high cost, switched paths.
It is desirable to provide a method and system having fast search capabilities through use of a TCAM for classifying a plurality of types of data traffic and route lookup.
Summary of the Invention
The present invention relates to a method and system for supporting a plurality of data flows in a router using a ternary content addressable memory (TCAM) in which the number of accesses to the TCAM is optimized to improve efficiency of updating and subsequent look up. To accommodate the plurality of data flows, the TCAM is partitioned into at least two partitions in which a first portion includes indices having a higher priority and a second portion includes indices having a lower priority. For example, multiple protocol label switching (MPLS) flows and IP-Virtual Private Network (VPN) can be added to the first partition and policy based routing flows can be added to the second partition. During subsequent TCAM look-up of a predetermined prefix of an incoming packet the MPLS or IP-VPN flow will subsume any matching policy based routing flow, such as flows classified by an access control list or traffic manager flows.
In the case of MPLS and IP-VPN flows, flows classified by connection index (CIX) and destination IP address (DA) and flows classified by CIX only are added from the top of the first partition of the TCAM and flows classified by DA only are added from the bottom of the first partition. This arrangement has the advantage that CLX and DA flows and CLX only flows subsume DA only flows at higher indices and CIX and DA flows and CDv only flows are separated from DA only flows to optimize the number of
swaps needed when adding a new flow. To reduce the number of writes to the TCAM, a flow index space is used having entries corresponding to the TCAM space. Swaps are performed in the index space and only the changed entries are written to the TCAM. The invention will be more fully described by reference to the following
drawings.
Brief Description of the Drawings
Fig. 1 is a high-level functional block diagram of a system architecture for classifying flows in a router in accordance with the teachings of the present invention.
Fig. 2 is a schematic diagram of implementation of a flow classifier and flow manager.
Fig. 3 is a schematic diagram of a TCAM flow entry.
Fig. 4A is a schematic diagram of a prefix tree for storing flows classified by a connection index (CIX).
Fig. 4B is a schematic diagram of a prefix tree for storing flows classified by a destination address (DA).
Fig. 5 A is a schematic diagram of data organization of a flow TCAM for MPLS and IP-VPN flows classified by CIX and DA before addition of the flow when no DA flow is present.
Fig. 5B is a schematic diagram of data organization of a flow TCAM for MPLS and IP-VPN flows classified by CIX and DA after addition of the flow when no DA flow is present.
Fig. 5C is a schematic diagram of data organization of a flow TCAM for MPLS and IP-VPN flows classified by CIX and DA before addition of the flow when DA flow is present.
Fig. 5D is a schematic diagram of data organization of a flow TCAM for MPLS and IP-VPN flows classified by CLX and DA after addition of the flow when DA flow is present.
Fig. 6A is a schematic diagram of data organization of a flow TCAM for MPLS and IP-VPN flows classified by DA before addition of the flow when no CLX, DA or CIX flows are present.
Fig. 6B is a schematic diagram of data organization of a flow TCAM for MPLS and IP-VPN flows classified by DA after addition of the flow when no CIX, DA or CIX flows are present.
Fig. 6C is a schematic diagram of data organization of a flow TCAM for MPLS and IP-VPN flows classified by DA before addition of the flow when CIX, DA or CIX flows are present
Fig. 6D is a schematic diagram of data organization of a flow TCAM for MPLS and IP-VPN flows classified by DA after addition of the flow when CIX, DA or CIX flows are present.
Fig. 7 is a schematic diagram of data organization of a flow TCAM for policy based routing flows.
Detailed Description
Reference will now be made in greater detail to a preferred embodiment of the invention, an example of which is illustrated in the accompanying drawings. Wherever possible, the same reference numerals will be used throughout the drawings and the description to refer to the same or like parts.
Referring to Fig. 1 there is shown a high-level functional block diagram of the system architecture for classifying and routing flows in a router 10 in accordance with the teachings of the present invention. A flow is a set of data packets that obey a rule or policy identified from the content of the packet header fields of the data packets. The packet header fields can include for example the source IP address, destination IP address, source port, destination port, protocol identification, type of service (TOS), connection index (CIX) and other fields. The architecture comprises three major elements, control plane 12, data plane 13 and layer 2 interface 14. The interaction between the various elements is represented by the series of arrows between corresponding elements. Control plane 12 which can be implemented in software is comprised of flow manager 15, data plane control interface 16, flow core control 17 and . IP, User Datagram Protocol ("UDP") and Transmission Control Protocol ("TCP") 18. Data plane 13 which can be implemented in hardware is comprised of flow classifier 20, IP forwarder 21 and label forwarder 22. IP traffic and IP control traffic 23 is received at
flow classifier 20. Flow classifier 20 interacts with flow manager 15 and flow core control 17 for classifying and routing IP traffic and IP control traffic 23 and applying destination routes through label forwarder 22, in the case of MPLS flows, or IP forwarder 21 in the case of non-MPLS flows. Flow core control 17 can comprise software modules such as, for example, TEP, red manager, label manager, route watch, routing manager and FIB and an IP routing data base. While the present invention is particularly well suited for use with the AmberNetwork ASR 2000 and ASR 2020 devices as described herein, it is equally suited for use with other routers having similar capabilities and features. The AmberNetwork ASR 2000 and ASR 2020 technical manuals are incorporated herein by reference as if fully set out.
Fig. 2 is a schematic diagram of an example implementation of flow classifier 20 and flow manager 15. In this embodiment, flow classifier 20 comprises flow ternary content addressable memory (TCAM) 30. Flow TCAM 30 is a hardware memory device where all entries in the TCAM are compared in parallel against incoming packet header fields and the first matching entry is selected in a single clock cycle. A suitable TCAM is manufactured by Lara Technology Inc., San Jose, California and as described in U.S. Patent No. 6,081,440 hereby incorporated by reference into this application. Each flow TCAM 30 entry is addressed or indexed by indices 32. Indices 32 can be an index or numerical address. Indices 32 are arranged from lowest index 32a to highest index 32n with priority being greatest at lowest index 32a and being least at highest index 32n.
Fig. 3 illustrates a representative TCAM flow entry 33 to be stored in flow TCAM 30. A local mask 34 is associated with each TCAM flow entry 33 for effectively storing in flow TCAM 30 either a logic 0, a logic 1, or a don't care for a flow TCAM look up operation. For example, if a bit of local mask 34 is a logic 1, the corresponding bit of TCAM flow entry 33 is compared to a corresponding bit of an incoming data packet during a subsequent flow TCAM look up operation. Conversely, if local mask 34 is a logic 0, the corresponding bit of TCAM flow entry 33 is not compared during a subsequent flow TCAM look up operation. Alternatively, in other embodiments of the present invention the mask bit scheme can be inverted such that a mask bit is equal to logic 1, the corresponding bit of the TCAM flow entry is masked and if a mask bit is equal to a logic 0 the corresponding bit of the TCAM flow entry is compared. A prefix
can be associated with one or more of the fields in flow TCAM entry 33, such as the destination IP address, to indicate the number of bits of the destination IP address of the packet header to be matched in flow TCAM 30. In a subsequent flow TCAM look up operation, if there is a match between the unmasked flow TCAM entry and the predetermined prefix corresponding to the incoming packet header bits, the index of the matching TCAM flow entry 33 as well as any routing data stored in flow TCAM 30 or in an associated external memory such as for instance, an SRAM, is provided as output.
Flow manager 15 is used to provide data structure organization of flow TCAM 30. Referring to Fig. 2, flow manager 15 can partition indices 32 into one or more logical partitions. Flows are assigned to partitions depending on a desired priority for the type of flow. In this embodiment, indices 32 are partitioned into partition 36a which partition includes lowest index 32a and partition 36b which partition includes highest index 32n. A FTCAM_ Partition index is located between partition 36a and partition 36b. In the embodiment shown in Fig. 2, MPLS and IP-VPN flows are determined to have the highest priority and are assigned to partition 36a. Policy-based routing flows are determined to have lower priority and are assigned to partition 36b. Policy based routing flows can include data classified by Access Control Lists (ACL) flows and traffic manager (TE) flows. Accordingly, MPLS flows and IP-VPN flows which have been assigned higher priority will be found in a subsequent lookup in flow TCAM 30 before ACL flows and TE flows which have been assigned a lower priority and MPLS flows or IP VPN flows will subsume any matching ACL flows and TE flows in flow TCAM 30.
Flow index space 38 can be maintained in flow manager 15 to correspond to data organization of flow TCAM 30. All flow swapping can be performed in flow index space 38 and only the changed entries are written to the flow TCAM 30.
In an embodiment of the present invention, an array of pointers and prefix trees are used to store MPLS and IP-VPN flows in flow index space 38, as shown in Figs. 4A and 4B. Flows which are classified by connection index CIX and destination IP address (DA) fields of the packet header, are stored in CIX prefix tree 40. Each connection index (CIX1-CIX16K) is associated with node 41a-41n of prefix tree 40. A destination IP address based lookup is performed to find the longest match of a prefix stored in a respective node 41a-41n. Flows are maintained in order to match the correct flow during
flow TCAM 30 look up. A variable gMaxCixDaFix is used in flow index space 38 to indicate the maximum flow TCAM Index of the CIX and DA flows and CIX only flows. Flows which are classified by destination IP address only are stored in DA prefix tree 42. Each DA is associated with node 44 of prefix tree 42.
A variable gMinDaOnlyFix is used in flow index space 38 to indicate the minimum flow TCAM index for DA only flows
A software module can be implemented in flow manager 15 for MPLS and IP-VPN flow organization of TCAM 30. A representative software module is illustrated in Table 1.
Table 1
(Table Removed)
Figs. 5A-5D illustrate an example of data organization of flow TCAM 30 for MPLS and IP-VPN flows. Flows are maintained in order to match the correct flow during flow TCAM 30 look up. Partition 36a is divided into lower index portion 50a and higher index portion 50b. Lower index portion 50a corresponds to a lower index or
address range and higher index portion 5 Ob corresponds to a higher index or address range. Flows which are to be classified by the connection index (CIX) and Destination IP Address (DA) fields of the packet header, referred to as CDC, DA, are assigned to lower index portion 50a. Flows which are classified only by the CIX of the packet header are also assigned to lower index portion 50a. Flows which are classified only by the DA of the packet header are assigned to higher index portion 50b. Local mask 34 can be applied to each flow TCAM entry 33 to effectively store the particular type of data flows, such as the above-described CDC, DA flows, CIX only flows and DA only flows, for use in compare operations of flow TCAM 30. For example, CDC only flows can occur when local mask bits of the DA are zero and local mask bits of the CIX are all one.
During adding of flows classified by CIX, DA or CDC only to TCAM 30, a free entry in TCAM 30 is searched from lowest index 32a of lower index portion 50a. The free entry is referred to as Fix. During adding of flows classified as DA flows, a free entry in TCAM 30 is searched from highest index 32b of highest index portion 50b. An index corresponding to a maximum value of lowest index portion 50a is established as gMaxCixDaFix and an index corresponding to minimum value of a highest index portion 50b is established as gMinDaOnlyFix. In this manner, maximum free space 54 is achieved between lower index portion 50a and higher index portion 50b, thereby maintaining the CDC, DA flows and CIX only flows together and the DA only flows together and separately the CIX, DA flows and CIX only flows from the DA flows. During deletion of flows classified by CIX, DA or CDC only from TCAM 30, the entry at a corresponding index 32 is invalidated in flow space 38. Thereafter, during subsequent adding of flows classified by CDC, DA or CDC only, the invalidated entry is found during a search for free entries from lowest index 32a of lower index portion 50a the flow is added to re-use the previously invalidated entry. Accordingly, only if TCAM 30 is substantially at capacity will it be necessary to swap a DA only flow to insert a CIX, DA or CIX only flow or to swap a CIX, DA flow or CIX only flow to insert a DA only flow.
Figs. 5A-5B illustrate assignment of CIX, DA flows and CIX only flows if no DA only flows exist or a free TCAM entry, Fix, is above the DA only flows at a lower index value than gMinDaOnlyFix. The gMaxCixDaFix index entry is set immediately after the index corresponding to Fix. Figs. 5C-5D illustrate assignment of CIX, DA and CIX only
flows if there are DA only flows present or a free TCAM entry, Fix, is between the DA only flows. In this embodiment, TCAM 30 is almost full. There exists no free entries from lowest index 32a past gMaxCixDaFix and gMinDaOnlyFix indices. Accordingly, the gMaxCixDaFix and gMinDaOnlyFix indices are adjacent indices. A free entry is available between the index of gMinDaOnlyFix and highest index 32b. For example, the free entry can occur in the Da flow space because of an earlier deletion of a DA flow. In order to use the free entry, Fix, for a flow classified by CIX, DA or CIX only, the DA flow at the gMinDaOnlyFix index is moved into Fix, thereby making the gMinDaOnly Fix index available. The flow classified by CIX, DA or CIX only is written at the current index for gMinDaOnlyFix. The gMaxCixDaFix index is set at the written TCAM entry for the flow classified by CDC, DA or CIX only and the gMinDaOnlyFix entry is set . immediately after the written TCAM entry. The other CIX, DA and CIX only flows between lowest index 32a and the gMaxCixDaOnlyFix index in TCAM 30 are adjusted for proper subsuming ordering. The other DA only flows between the gMinDaOnlyFix index and highest index 32b are adjusted for proper subsuming ordering.
A software module can be implemented in flow manager 15 for adding CDC, DA flows and CIX only flows to TCAM 30. A pointer to the current flow is referred to as pflow. A pointer to the free entry is referred to as fix. The TCAM flow entry 33 is written to flow TCAM 30 by an AdjustAndWriteCixDA(pflow, fix) function, described below in order to adjust the writing at TCAM flow entry 33 into flow TCAM 30 based on local mask 34 of other DAs in the same CIX. A representative software module is illustrated in Table 2.
Table 2
(Table Removed)
AdjustAndWriteCixDA ( pFlow, Fix) / take care of subsuming issues with other DAs in same Cix, based on subnet masks/
Set gMaxCixDaFix return
else
4. EndofinsertCixDaFlow
During inserting of CIX, DA flows, CIX only flow and DA only flows the flows in flow TCAM 30 are adjusted such that flow TCAM 30 is ordered to have the TCAM entry with the longest prefix located at the index having highest priority which is the lowest index or lowest numerical value and the TCAM entry followed by decreasing prefix values with the shortest prefix is located at the index having lowest priority which is the highest index or highest numerical value. Tables 3 and 4 illustrate respective software modules which can be implemented in flow manager 15 for adjusting and writing DA only flows and adjusting and writing Fix and DA flows and which modules are used in the software module illustrated in Table 1.
Table 3(Table Removed)
Table 4
1. Begin adjustAndWriteCixDA (pFlow, Fix)
2. Using the mask length of the destination IP address in the flow, first fix the
shorter-prefix flow in prefix tree 40. If a shorter prefix node is found and the index of the
found node is less than the index of the pFlow node, swap the two flows and write only
the second flow to flow TCAM 30. Then continue search with the removed flow to
locate routes that are subsumed.
3. Write the last best flow into its correct location and remember this so that it
doesn't have to be re-written again below.
4. At this point pFlow is pointing to the shortest-prefix flow whose index had to
be adjusted to follow LPM property and that matched the original flow that had to be
inserted in flow TCAM 30.
5. Fix the longer-prefix flows in TCAM 30. Starting from mask length 32 and
going downwards to current mask length, find largest flow index flow that gets
subsumed.
6. If the found flows flow index is greater than the index of current flow, it means
that a flow with a longer prefix to the same destination is before the current one which
has a shorter prefix. In this case swap the two flows in the TCAM 30 and fix the index
values in the flows. Write the second flow to TCAM 30.
7. Write the last best flow into its correct location. If this is same flow as that
already written in step 3 above, TCAM 30 is not written again.
8. End of AdjustAndWriteCIXDA(pFlow, Fix).
(Table Removed)
Figs. 6A-6B illustrate assignment of DA only flows if the first free TCAM entry, Fix, is located after both CIX, DA flows and CDC only flows or if there are no CDC, DA flows. The gMinDaOnlyFix index entry is set at the index corresponding to Fix. Figs. 6C-6D illustrate assignment of DA flows if the first free TCAM entry, Fix, is between CIX, DA or CIX only flows. In this embodiment, TCAM 30 is almost full. There exists no free entries from highest index 32b past gMinDaOnlyFix and gMaxCixDaFix. Accordingly, the gMaxCixDaFix and gMinDaOnlyFix indices are adjacent indices. A free entry is available between the index of gMaxCixDaFix and lowest index 32a. For example, the free entry can occur in CDC, DA and CIX only flow space because of an earlier deletion of a CDC, DA or CIX only flow. In order to use the free entry, Fix, for a flow classified by DA only, the CIX, DA or CDC only flow at the gMaxCixDaFix index is moved into Fix, thereby making the gMaxCixDaFix index available. The flow classified by DA is written at the current index for gMaxCixDaFix. The gMinDaOnlyFix entry is set at the written TCAM entry and the gMaxCixDaFix entry is set immediately before the written TCAM entry. The other DA flows between highest index 32b and the gMinDaOnlyFix index are adjusted for proper subsuming ordering. The other CDC, DA and CIX only flows between gMaxCixDaFix and lowest index 32a are adjusted for proper subsuming ordering.
A software module can be implemented in flow manager 15 for adding DA flows to TCAM 30. A representative software module is illustrated in Table 5.
Table 5
(Table Removed)
The clients of flow manager 15 are responsible for removing flows in TCAM 30 if an interface goes down. Flow manager 15 provides an Application Programming Interface
(APIs) to withdraw routes based on the application handle. For example, if an IP circuit goes down the connection manager informs the IP task and the VPN manager receives this alarm. The VPN manager in turn withdraws the routes from flow TCAM 30 based on the circuit identifiers.
A software module can be implemented in flow manager 15 for removing flows in TCAM 30. A representative software module is illustrated in Table 6.
Table 6
(Table Removed)
ACL flows and traffic manager (TE) flows are internally stored in a flow index space corresponding to the Flow TCAM by the Flow Manager 15, as shown in Fig. 7. The ACL flows and TE flows are strictly ordered based on the command line interface (CLI) defined access control lists (ACLs). ACLs are typically applied to network interfaces to permit or deny certain kinds of network traffic. All packets matching a particular ACL flow will be allowed to pass through and a network route is determined. All packets not matching the ACL flow will be dropped or a policing or shaping of type of service ("TOS") operation will be performed on the packets. A global access-list is used at all interfaces.
The ACL and TE flows are maintained in order when added to flow TCAM 30. Flows are added to the next available index entry located in flow TCAM 30 starting from top 60 of partition 36b. Partition 36b is further subdivided into portions 62a and 62b.
Portion 62a is used for ACLs applied to interfaces and portion 62b is used for global ACLs which will be used if no other ACL matches. A GACLJPARTITION variable can be used to define the partition size of portion 62a and 62b. A gMaxACLFix variable defines a maximum flow TCAM index for ACL and TE flows in portion 62a. A gGlobalACLFix variable defines a maximum flow TCAM index for Global ACL & TE flows in portion 62b.
Policy based ACL and TE flows are added at the location of the gMaxACLFix variable and the gMaxACLFix variable is incremented. If gMaxACLFix becomes equal to the GACL_PARTITION variable, portion 62b is full and no more ACL flows can be added until some flows are deleted. An ACL flow can specify a range of source or destination ports. The ACL flow that specifies a range of source or destination ports is mapped to multiple flows, with a local mask 34 to cover a portion of the range. Accordingly, the optimal number of flows with different masks are determined to cover the specified range. For the flows which map to multiple flows in the TCAM, an application programming interface (API) can create peer flows with an assigned local mask 34 and add the peer flows along with the parent flow to flow TCAM 30 which flows can be managed by flow manager 15.
Global ACL flows are added at gGlobalACLFix variable and then the gGlobalACLFix variable is incremented. If the gGlobalACLFix variable becomes equal to a FM_MAX_FIX variable, then no more Global ACL flows can be added until some flows are deleted from TCAM 30.
Flow manager 15 includes software modules which are responsible for removing flows from TCAM 30. For single flow deletion, the flow will be removed from flow index space 38 and is invalidated in flow TCAM 30. First API 66 is used to delete a single flow from TCAM 30. If the single flow has peer flows all of the peers will also be deleted. Flows remaining in flow TCAM 30 are compacted immediately in order to fill up the vacant flow space. All flows after the deleted flow are moved up by one index and are written to TCAM 30. The value of the gMaxACLFix variable is adjusted accordingly.
For multiple flow deletion, all flows in the supplied flow list will be removed and then compaction will be performed on remaining flows. A second API 67 is used to
delete a list of flows for deleting multiple flows from TCAM 30. The first empty flow space is filled first by the next available occupied flow and this is repeated until all flows are compacted such that all empty flow spaces before the gMaxACLFix variable are filled up. The value of the gMaxACLFix variable is adjusted accordingly.
In view of the foregoing description, numerous modifications and alternative embodiments of the invention will be apparent to those skilled in the art. It should be clearly understood that the particular exemplary computer code can be implemented in a variety of ways in a variety of languages, which are equally well suited for a variety of hardware platforms.
It is to be understood that the above-described embodiments are illustrative of only a few of the many possible specific embodiments which can represent applications of the principles of the invention. Numerous and varied other arrangements can be readily devised in accordance with these principles by those skilled in the art without departing from the spirit and scope of the invention.



We claim:-
1. An apparatus for classifying a plurality of data flows in a routing system
comprising:
a ternary content addressable memory (TCAM) (30);
a partitioning unit (15) to partition said TCAM into at least a first partition and a second partition, said first partition comprises indices having highest priority ranging from a lowest index to a partition index and said second partition comprises indices having lowest priority ranging from a highest index to said partition index;
a selector to select a respective mask value to structure one or more first flow TCAM entries of a first of said data flows and one or more second flow TCAM entries and said respective mask values into said second partition; and
a comparator to perform an associative comparison of a prefix comprising predetermined packet header information of an incoming packet to said loaded one or more first flow TCAM entries and one or more second flow TCAM entries of a first of said plurality of data flows into said first partition in a predetermined order such that a matching said one or more first TCAM entries subsumes any matching said one or more second TCAM entries;
2. The apparatus as claimed in claim 1 wherein said first plurality of data flows are multiprotocol label switching (MPLS) or internet protocol virtual private network (IP-VPN) flows.
3. The apparatus as claimed in claim 1 wherein said second plurality of data flows are policy based routing flows.
4. The apparatus as claimed in claim 1 wherein said partitioning unit and said selector are implemented in flow index space having entries corresponding to said TCAM and said predetermined order of said first TCAM entries and said predetermined order of said second TCAM entries is represented in said flow index space.

5. The apparatus as claimed in claim 2 wherein said MPLS or IP-VPN flows are classified by connection index (CIX) and destination address (DA), CIX only or DA only.
6. The apparatus as claimed in claim 5 wherein said first partition is divided into a first portion and a second portion, said first portion comprises indices having highest priority ranging from said lowest index to a gMaxCixDaFix index and said second portion comprises indices having lowest priority ranging from a gMinDaOnlyFix index to said partition index and said flows classified by CIX and DA and CIX only are assigned to said first portion and said flows classified by DA only are assigned to said second portion.
7. The apparatus as claimed in claim 6 wherein said flows classified by CIX and DA and flows classified by CIX only are loaded into said first portion starting at a beginning of said TCAM and ending at said gMaxCixDaFix index and said flows classified by said DA only are loaded in said second portion starting at said gMinDaOnlyFix index.
8. The apparatus as claimed in claim 1 wherein said predetermined order of said first partition has one of said TCAM entries with a longest prefix located at an index having highest priority followed by decreasing prefix values with a shortest prefix at an index having lowest priority.
9. The apparatus as claimed in claim 3 wherein said second partition is divided into a first portion and a second portion, said first portion comprises indices having highest priority ranging from a lowest index in said second partition to a GACL-partition index and said second portion comprises indices having lowest priority ranging from said GACL-partition index to said highest index in said second partition, ACL flows are assigned to said first portion and global access list flows are assigned to said second portion.

10. The apparatus as claimed in claim 9 wherein said ACL flow is to specify specifies a range of source or destination ports, and to map said ACL flow to multiple flows in said TCAM wherein said mask for each of said multiple flows covers a portion of said range of source or destination ports.
11. A method for classifying a plurality of data flows in a router comprising:
partitioning a ternary content addressable memory (TCAM) into at least a first partition and a second partition;

said first partition comprises indices having highest priority ranging from a lowest index to a partition index and said second partition comprises indices having lowest priority ranging from a highest index to said partition index;
loading one or more first flow TCAM entries of a first of said plurality of data flows into said first partition in a predetermined order;
loading one or more second flow TCAM entries of a second of said plurality of data flows into said second partition in a predetermined order;
setting bit values of a corresponding mask for each of said first TCAM entries and said second TCAM entries such that bits of said respective first TCAM entries and said second TCAM entries are individually masked by said masks; and
comparing a prefix comprising predetermined packet header information of an incoming packet to said loaded one or more first TCAM entries and one or more second TCAM entries such that a matching said one or more first TCAM entries subsumes any matching said one or more second TCAM entries.
12. The method as claimed claim 11 wherein said first plurality of data flows are multiprotocol label switching (MPLS) or internet protocol virtual private network (IP- VPN) flows.
13. The method as claimed in claim 11 wherein said second plurality of data flows are policy based routing flows.
14. The method as claimed in claim 13 wherein said policy based routing flows are access control list (ACL) flows.
15. The method as claimed in claim 13 wherein said policy based routing flows are traffic manager flows.
16. The method as claimed in claim 11, wherein, said TCAM maintaining a flow index space having entries; and determining said predetermined order of said first TCAM entries and said predetermined order of said second TCAM entries in said flow index space before said steps of loading said one

or more first TCAM entries and loading said one or more second TCAM entries.
17. The method as claimed in claim 12 wherein said MPLS or IP-VPN flows are classified by connection index (CIX) and destination address (DA), CIX only or DA only.
18. The method as claimed in claim 17 wherein said first partition is divided into a first portion and a second portion, said first portion comprises indices having highest priority ranging from said lowest index to a gMaxCixDaFix index and said second portion comprises indices having lowest priority ranging from a gMinDaOnlyFix index to said partition index and said flows classified by CIX and DA and CIX only are assigned to said first portion and said flows classified by DA only are assigned to said second portion.
19. The method as claimed in claim 18 wherein said flows classified by CIX and DA and said flows classified by CIX only are loaded into said first portion starting at said gMaxCixDaFix index and said flows classified by said DA only are loaded in said second portion starting at said gMinDaOnlyFix index.
20. The method as claimed in claim 16 wherein maintaining a flow index space having entries corresponding to said TCAM; assigning said MPLS or IP-VPN flows classified by CIX and DA and CIX only to a CIX prefix tree; and assigning said MPLS or IP-VPN flows classified by DA only to a DA prefix tree.
21. The method as claimed in claim 16 wherein removing a flow in said TCAM by freeing up a corresponding entry in said flow index space and invalidating a corresponding TCAM entry.
22. The method as claimed in claim 11 wherein said predetermined order of said first partition has one of said TCAM entries with a longest prefix located at an index having highest priority followed by decreasing prefix values with a shortest prefix at an index having lowest priority.
23. The method as claimed in claim 13 wherein said second partition is divided into a first portion and a second portion, said first portion comprises indices having highest priority ranging from a lowest index in said second partition to a GACL-partition index and said second portion comprises

indices having lowest priority ranging from said GACL-partition index to said highest index, said ACL flows are assigned to said first portion and global access list flows are assigned to said second portion.
24. The method as claimed in claim 23 wherein said ACL flow specifies a range of source or destination ports, wherein, mapping said ACL flow to multiple flows in said TCAM wherein said mask for each of said multiple flows covers a portion of said range of source or destination ports for optimally determining a number of flows to cover said portion of said range.
25. The method as claimed in claim 23 or 24 wherein maintaining a flow index space having entries corresponding to said TCAM.
26. The method as claimed in claim 25 wherein removing said multiple flows in said TCAM by freeing up corresponding said entries in said flow index space and invalidating said corresponding entries in said TCAM entry; and compacting said flows in said TCAM by moving each remaining said TCAM entry up by one or more indices.
27. A method substantially as herein described with reference to the foregoing description, tables and the accompanying drawings.
28. An apparatus as herein described with reference to the foregoing description, tables and the accompanying drawings.




Documents:

358-DELNP-2003-Abstract-(17-06-2008).pdf

358-delnp-2003-abstract.pdf

358-delnp-2003-assignment.pdf

358-DELNP-2003-Claims-(08-04-2009).pdf

358-DELNP-2003-Claims-(17-06-2008).pdf

358-DELNP-2003-Claims-(31-03-2009).pdf

358-delnp-2003-claims.pdf

358-DELNP-2003-Correspondence-Others-(01-04-2009).pdf

358-DELNP-2003-Correspondence-Others-(08-04-2009).pdf

358-DELNP-2003-Correspondence-Others-(17-06-2008).pdf

358-DELNP-2003-Correspondence-Others-(23-06-2008).pdf

358-DELNP-2003-Correspondence-Others-(31-03-2009).pdf

358-delnp-2003-correspondence-others.pdf

358-DELNP-2003-Description (Complete)-(08-04-2009).pdf

358-delnp-2003-description (complete)-17-06-2008.pdf

358-delnp-2003-description (complete).pdf

358-DELNP-2003-Drawings-(17-06-2008).pdf

358-delnp-2003-drawings.pdf

358-DELNP-2003-Form-1-(08-04-2009).pdf

358-delnp-2003-form-1.pdf

358-delnp-2003-form-18.pdf

358-DELNP-2003-Form-2-(08-04-2009).pdf

358-DELNP-2003-Form-2-(17-06-2008).pdf

358-DELNP-2003-Form-2-(31-03-2009).pdf

358-delnp-2003-form-2.pdf

358-DELNP-2003-Form-3-(23-06-2008).pdf

358-delnp-2003-form-3.pdf

358-delnp-2003-form-5.pdf

358-delnp-2003-gpa.pdf

358-delnp-2003-pct-101.pdf

358-delnp-2003-pct-210.pdf

358-delnp-2003-pct-220.pdf

358-delnp-2003-pct-301.pdf

358-delnp-2003-pct-304.pdf


Patent Number 233892
Indian Patent Application Number 358/DELNP/2003
PG Journal Number 18/2009
Publication Date 01-May-2009
Grant Date 16-Apr-2009
Date of Filing 11-Mar-2003
Name of Patentee NOKIA INC.
Applicant Address 313 FAIRCHILD DRIVE, BUILDING A, MOUNTAIN VIEW, CA 94043, U.S.A.
Inventors:
# Inventor's Name Inventor's Address
1 MATE, ASHUTOSH 492 S. DEMPSEY ROAD #292, MILPITAS, CA 95035, U.S.A.
2 MAHAMUNI, ATUL 4260 ALBANY DRIVE #1-309, SAN JOSE, CA 95129, U.S.A.
3 CHANDER, VIJAY 400 RIVERSIDE COURT #108,, SANTA CLARA, CA 95054, U.S.A.
PCT International Classification Number G06F 15/173
PCT International Application Number PCT/US02/21229
PCT International Filing date 2002-07-03
PCT Conventions:
# PCT Application Number Date of Convention Priority Country
1 09/909,739 2001-07-20 U.S.A.