[ Home ] [ Table of Contents ] [ About Lee Goeller ] [ Search ]

Goeller on Telecom Traffic

(Lee Goeller writes: I combined material from my several articles into a book which BCR published in 1983: "A First Course in Telephone Traffic Engineering." So far as I know, it is the first book to include a floppy disk with all my traffic programs, somewhat expanded for general use, included. I wrote the book to work with the programs, and the programs to work with the book. The text of the book is included here, without the disk and some of the material generated by programs in BASIC rather than a word processor. )

A First Course in
Telephone Traffic Engineering

Table of Contents

(Webmaster's Note: Clicking on "Figure" links will open the Figure in a separate window.)

Chapter 1: Traffic Handling Systems, Basic assumptions

How To Use This Course

This course is based on the assumption that you want to learn how to use telephone traffic theory to design communication networks of the sort used by businesses, institutions, governments, and even telephone companies. As a result, it does not follow the typical pattern which requires the memorization of formulas well known for 70 or more years and which, without a computer, are almost completely unusable. (Appendix I contains some of the simpler formula derivations, along with references for further reading). Rather, it substitutes a floppy disk on which these formulas are programmed to be ready for immediate use. As you read through the text, call up the programs where indicated, run them, and see what happens. Appendix IV contains suitable instructions.

If you do not have access to a computer, or are reading on an airplane or in the comfort of your living room chair, sample runs for specific programs, similar to those you would see on the screen, are also listed in Appendix IV. Further, sample traffic tables are given in Appendix II for quick reference. However, it is strongly recommended that you get in the habit of calling up and running specific programs as you need them.

For those interested in the programs themselves, Appendix III contains an article I wrote several years ago to illustrate my approach to the problem. Even the simplest of the traffic formulas are seen there to be quite complex; I feel they have to be considered a sort of flowing process rather than a single calculation. Viewed in that light, the program listings are deceptively brief, considering what they do. It is interesting to note that even Bell Labs, as of 1953, did not have tables of the Erlang C distribution; now anybody with a toy computer can generate them at will, and, indeed, many people all across the country are using the programs from my article today.

The programs on the disk that comes with this course are almost all somewhat more involved, and reflect additional needs and requirements that I have found necessary in my own consulting practice. Further, they deal better with some of the strange behavior of computers which do not like very large or very small numbers or have other eccentricities. Computers are strange machines, but I often wonder how anybody worked in traffic before they became generally available. Without a computer to provide numbers, even a perfect understanding of theory and the derivation of the formulas is almost useless. With a computer, even those of us who are not PhDs in mathematics can design quite respectable networks and save our employers and clients large sums of money.

But what is a good network? Anyone who cannot stand ambiguity should not study telephone traffic. Traffic is a statistical subject by its very nature, and "right" answers are not always meaningful, particularly if we express them to an accuracy of six decimal places. Exactly how many circuits we need, exactly how many calls we can serve, are not meaningful terms. When we try to predict the future on the basis of measurements made in the past, there is always a margin for error. To get any sort of answers at all, we have to make assumptions; our answers, then, can be no better than the way the assumptions work out in any specific real situation. They may, in fact, be a lot worse. But to start off, we must understand the assumptions on which our concepts are based. The rest of this chapter introduces the more important of these assumptions, leaving the following chapters to deal with specific formulas and design principles.

Grade Of Service

The concepts of "Sales" and "Service" are so closely interlinked that we seldom bother to notice that they are quite separate and distinct ways of operating a business. If we think about the matter at all, we almost always think of a business as selling a product which, in our optimism at the moment of purchase, will never break down and need to be serviced. But many businesses are based on selling a service, not a product and, as a result, have to be thought about in a way that tends to be alien to our normal way of considering economic concepts.

The classic example of a service-vending business, as opposed to one that sells a product, is the telephone industry. Until recently, telephone companies sold telephone calls, not telephone equipment, and provided equipment to facilitate calling on a service rather than sales basis; that is, you rented the ability to make phone calls, not a telephone set, PBX, or some other piece of equipment. The telephone company installed the equipment, maintained it, paid taxes and insurance on it and, in many cases, even provided the power to make it operate.

In the public telephone network, the telephone company still provides switches with the ability to connect one customer line to another, or a line to a trunk to another switch; trunks (individual voice circuits shared by customers) are provided between switches to assure the caller of a chance to reach any other telephone in the country. The rules and mechanisms by which the switches select the trunks required to establish any given connection are all part of the package the customer normally never even has to think about.

But the telephone company has to give considerable thought to the matter: how many trunks should be installed between Switch A and Switch B? How many connections at any one time should a switch be capable of supporting? How many signaling detectors are required to return dial tone and accept instructions from people initiating (or modifying) calls?

Clearly, it is always possible to put in lots of capacity to be sure that any call that comes along can be served. But, with too many facilities, many lying idle all the time, a system would cost too much and might not be affordable to its customers. Too few facilities, on the other hand, might be much less expensive, but would block revenue-producing calls and reduce telephone company income. Thus a relatively delicate balance is required: too many facilities reduce customers by increasing cost while too few facilities reduce calling by preventing call completion.

To put this in another form, consider the difference between an automobile dealer and a rent-a-car company. The dealer wants to sell as many cars as he can, so the number of cars he has available is limited only by his floor space and the money required to support his inventory. The rent-a-car company, however, does not sell cars. It rents them. It must have enough cars on hand to meet the variable day-by-day need, but not so many that a large proportion sit idly on the lot, tying up capital and producing no revenue.

There is one additional point to consider in this analogy: if people only rent cars when they need them, a rent-a-car company needs far fewer cars to serve the same number of customers than does a car dealer. For the dealer, one car serves one customer; for the renter, one car serves many. Thus service organizations tend to make more efficient use of some kinds of assets than do companies which sell the same assets as product.

Another point of some interest is the fact that service companies have a very strong motivation to make their serving systems work because they only earn money when delivering the vended service. Companies that sell product, however, are often relatively glad to see a product break down so that they can sell their customer a new item or fix his old one at a surprisingly high cost. Indeed, automobile repair parts cost so much more than the same parts when put into new vehicles that a whole new branch of crime has developed in response to the need: the chop shop. Who says free enterprise is dead?

In any event, we see that too few facilities turn away customers, while too many can increase costs to such a point that customers are again turned away. Thus we have to have some way of defining "enough." The concept of Grade of Service is therefore basic to the thinking of any person or company hoping to render a service for profit.

Grade of Service relates the number of customers to the number of servers. When we are talking about telephone trunks and people making calls, the callers are the customers in question, and the trunks are the servers. We can, equally well, think of servers as being check-out clerks in a supermarket, toll booth personnel on a highway or bridge, or cars in a rent-a-car lot. The Grade of Service tells how well we serve our customers.

The terms used to describe grade of service need to be examined. We usually look at the customers who do not get immediate service; those who are blocked or are delayed in a queue. With luck, this is a small number, and we can say that 5 calls in a hundred are blocked and have to retry, or 20 calls in a hundred are delayed in a queue. You will frequently hear grade of service described with a letter, a "dot," and some numbers, as P.05 (read "P dot oh five"). The "P" means that the Poisson probability distribution and its underlying assumptions, to be examined in Chapter 2, are employed and the dot 05 means 5 calls in 100 find all trunks busy (ATB). Today, it is more common to use the Erlang B distribution, and let "B.05" define our grade of service. Obviously, this means we are going to use the Erlang B distribution and its underlying physical assumptions which give a slightly different meaning to 5 calls in 100 finding ATB. See Chapter 3.

When we get into the subject of queuing calls, or making them wait until a WATS line, for instance, is free to serve them, grade of service must be defined differently. Although we could use the definition that is analogous to the probability of blocking--the probability that all trunks are busy is the probability that a call, originating at random, will go into the queue to wait for a facility to come free--this definition is less useful than defining how long a call has to wait in queue before it is served.

Two standard definitions are used: the average delay on all calls, called D1, including those calls that are not delayed at all (they luck out and arrive when there is no queue and at least one trunk is free), and the delay on only those calls that are delayed, called D2. Obviously, D2 is longer than D1 because it includes no calls with zero waiting time in queue. Another definition that can be used is the number of callers waiting in queue. Analogous to D1 and D2, we have a Q1 and a Q2, where Q1 is the average number of callers waiting, while Q2 is the average number waiting during ATB. But we'll return to this in Chapter 4.

The thing to remember now is that we either define Grade of Service as the proportion of customers that find all servers busy, or else we define it in terms of the average wait for service when a queue can occur. Typically, our dial-9 trunk group out of a PBX to a Central Office should have a grade of service in the B.05 range. B.01 is better than we need, and B.10 may make for problems. When we use expensive facilities such as Foreign Exchange (FX) or WATS lines, we approach things a little differently. We may permit a higher probability of blocking or use a queue.

When we queue calls, we find that D1 less than 30 seconds or greater than about 10 minutes does little to increase the number of calls we can handle on a given group of trunks. With short duration queues, there is a low probability of a circuit coming free during the waiting interval, while with long duration queues, we already have the circuits running full, and we can't pack them fuller, no matter how long we wait.

Randomness

Traffic theory has its roots in probability, and like most probabilistic things, it requires "randomness." In particular, we assume that calls "originate individually and collectively at random" in a "stationary random process." The phrases in quote marks, trotted out at cocktail parties, can have a devastating effect after three martinis, but all they imply is a spontaneity of call origination where callers are not influenced by each other or any common experience.

"Individually and collectively at random" can best be understood by observing what it is not. For instance, if a flying saucer lands on the company parking lot and, as a result, everybody picks up a phone to call security, we do not have calls originating at random. Similarly, the drop in phone rates at 11 PM leads to non-random traffic patterns. Even waiting until midnight for these non-random calls to be cleared implies a non-random origination of our own call.

This arrival of calls at random is one of the most important assumptions in traffic theory, but there is more to it. We still have to deal with the "stationary random process" business. It turns out that our traffic formulas do not hold if traffic is building up as it does between 8 and about 9:30 in the morning in a business, or is falling off as it does at lunch-time or after 4 PM. To be a stationary random process, our call arrivals must fluctuate around a constant average value. That is, there will sometimes be more and sometimes fewer calls in the system, but the average is constant over the study period and not building up or dying out.

For telephone traffic to be able to take advantage of the mathematics of probability and statistics, we assume that calls are placed at random--that is, every second during a study period is just as likely as every other second to see one or more calls originate. The probability that one or more calls end in a given second is proportional to the number of calls in progress and the average duration of a call, and is thus also pretty much random. Statistical fluctuations around a constant average describe the behavior of theoretical telephone traffic.

Holding Time

Additional assumptions concern "holding time." Holding time is not, as some people believe, the time they are left on hold when they call a busy company. Rather, it is the total time a server is occupied with a given call. Paid holding time or conversation time is something else again. It is the time your pay for when you make a call, excluding dialing, set-up and ringing, sometimes known as operating time. Note that we can occupy a coast-to-coast telephone company (but not a specialized carrier) circuit for an hour and, if nobody answers the call, we don't pay for it. Thus the conversation or paid holding time is zero, although the telephone company's traffic people will find their traffic recording equipment showing an hour of use.

In any event, holding time is the time we occupy a trunk or some other facility on a given call. And there has been a great deal of study of how holding times behave, statistically. With ordinary voice calls, we can measure the total use and the "peg count" or number of calls made, divide, and get an average holding time. But some calls are longer than others. Typically, we find a relatively large number of short calls and a smaller number of longer calls; the longer the call, the smaller the probability that it will be present.

The mathematicians calls such a distribution "exponential" or, properly, "negative exponential." A negative exponential distribution of holding times says that there will be an ever decreasing probability that a given call will last as long as or longer than an ever increasing duration. Measurements actually made on telephone calls over the years show that regular calls are, indeed, distributed this way.

We can thus calculate the probability of a call lasting some number of average holding times or longer by using standard tables, or the exponential key on a hand held calculator. But it is more fun to use a computer. Fire up your computer, put the disk in the slot, and run EXPO. EXPO will immediately give you a table of probabilities that a call will last longer than several selected multiples of one average holding time. See Printout 1-1 in Appendix IV.

We note immediately that any given call, if it exists at all, will have a probability of 1.0 (100%) of being 0 holding times or longer in duration, which seems sort of obvious. But is it as obvious that a call has a 61% probability of lasting longer than half a holding time, a probability of 0.37 of lasting longer than one holding time, but only a probability of about 5% (.05) of being longer than three holding times?

The program will now ask you how many holding times you would like to investigate. Give it any number you like, up to 14, integer or fraction. It will immediately respond with the probability that a call will last that long or longer.

The ability to measure things in terms of holding times is very important in studying traffic. However, we ultimately want to know the probability that a call will last longer than 15 minutes, or something definite like that. EXPO can help us. It should be sitting there now, asking you how many holding times you want. Give it a number bigger than 600, and it will break out of the loop to ask you for call holding time in minutes. Not minutes and seconds, but decimal minutes, such as 5.6 instead of 5 minutes and 36 seconds. Give it a "real" average holding time, and you can now get the probability that a call will last as long or longer than any number of minutes you punch in. Give it different numbers of minutes, one at a time, and see how it responds. Follow instructions on the screen to terminate the run when you are finished.

The exponential nature of holding times has a number of interesting consequences, two of which we will examine now, without proof. First of all, let us suppose that we find a call in progress. What is the probability that it will continue for one holding time? It turns out that, no matter how long the call has been up, the probability that it will continue for a given number of holding times is the same as if the call had just started. To put it another way, the continued duration of any call is independent of its past duration. Odd but true.

As a direct result of the above, but again we will not make any attempt to prove it (an exercise for the student?), the average delay the first call in a queue will encounter when it arrives and finds ATB is the average holding time on that trunk group divided by the number of trunks. For instance, suppose we have 5 trunks to Chicago handling 10-minute calls. When all trunks are found to be busy, the average wait for one (any one) to come free is 5/10, or half a minute. If we have 7-minute calls on a trunk group of 2, the average wait for one to come free is 3.5 minutes. If we hold calls in a 30 second queue before we let them overflow to some other trunk group, the odds are pretty good that the 5 trunk group will, in a specific instance, have a trunk come free before the queue times out, but the odds are much less for the two-trunk group.

Exponential holding times have been studied extensively, and the exponential model appears to be quite good for voice calls. However, there are many types of calls or operations that are of relatively fixed duration. The time required to dial a telephone number, for instance, makes the holding time of a Touch-Tone detector very nearly constant. There are a great many other traffic engineered operations that have a nearly fixed duration: the time it takes a directory assistance operator to help a customer, for instance, or the time it takes a toll-booth attendant on a bridge or highway to take a quarter. Thus the other holding time distribution that has been given attention by the mathematicians is the fixed holding time. The difference between fixed and exponential holding times comes out mostly in the study of queuing; it turns out that, if calls have a fixed length rather than one that is exponentially distributed, the delay in a queue is, in general, somewhat less. In data communication, packet switching, with packets of fixed length, takes advantage of this.

The thing to remember here is that, for voice calls, we will usually assume exponential holding times, while for many types of data engineering as well as the engineering of some kinds of voice switching equipment, constant holding times are more appropriate.

Traffic Units

Most of the troubles encountered by beginning students of traffic come from the fact that the ancient Babylonians had 30 fingers on each hand. At least it appears that they counted by 60s, what with 360 degrees in a circle, 360 days in a year, 60 seconds in a minute, and 60 minutes in an hour. Counting by 60s instead of 10s is a real bother.

When we measure traffic, we use some form of timer, and we measure the total time a server or group of servers is occupied during a study period. Thus we measure our traffic in seconds, minutes, hours, or some combination thereof. The problems come when we try to convert from 2.75 hours to minutes, or from 3.68 minutes to seconds. Now, 2.75 hours is 2 hours and 45 minutes, so the total number of minutes comes out to be 165. The number of seconds in 3.68 minutes is left as an exercise for the student.

In the early days, the main aspect of traffic dealt with staffing switchboards. Thus time-and-motion studies were conducted by efficiency experts clutching stop watches and measuring all the different operations performed by a telephone operator. Most such actions, such as plugging a cord into a jack, or looking up a name in a directory, were completed very quickly, so measurements in terms of seconds were quite satisfactory. When call durations were measured, however, seconds were too short. Thus the intrepid traffic engineers decided to simplify their lives by changing the size of the minute. They created a minute that is 100 second long, and called it a CCS for Hundred Call Seconds (well, C stands for 100 in Roman numerals). This way they could deal with switchboards and trunks in the same units, just by moving a decimal point around. Nice, simple and clean. The telephone industry in North America uses the CCS to this day.

When customers buy telephone calls, however, they do not understand the CCS unit. Thus the telephone company and most of the specialized common carriers print out telephone bills in terms of whole minutes. To keep things simple (and to make more money), each call is usually rounded up to the next whole minute so that fractional minutes need not be considered. Thus, if you go one second into the next minute, you pay for the whole thing (the exceptions are quite rare, and will be flagged when they occur). Be this as it may, minutes are used to measure traffic when we are concerned with paying for it.

There is only one unit left. Hours. And that is the most important one of all. Hours of use per month is much more tractable than minutes or CCS, and WATS and the specialized services are billed by the hour instead of by the minute. But there are other reasons why hours are the best units to use. We will investigate them right after we look at the concept of the "busy hour."

The Busy Hour

It was stated above that, for our traffic assumptions to work, our traffic must be in statistical equilibrium; that is, it fluctuates, but around a constant average value. The average itself does not change during the study period. This suggests there are limits on how long a study period can be.

If it is too long, we know there will be variations that are not statistical but causal: we know that nobody is at work before 8 AM, that traffic builds up during the business day but falls off at lunch time, and then builds up for the afternoon until the five o'clock whistle kills it, as suggested by Fig. 1-1. Thus a day is too long a study period.

Fig. 1-1. Typical business day telephone traffic.

On the other hand, a one-minute interval, for ordinary telephone voice traffic, is much too short. On a minute by minute basis, the number of calls on a given group of trunks can be seen to fluctuate all over the place, and any one minute is hardly representative of anything.

So what do we do? We choose a study period that's not too long and not too short, but just right. An hour turns out to be just about what we want. It is long enough to give stable averages, and short enough to miss the major daily causal fluctuations. Specifically, it is longer than the average holding time per call by about a factor of 10, but it is shorter than the main work intervals in the morning and afternoon. Thus an hour is just about what we want.

But which hour? Where should an hour start? For convenience, we make measurements that often start at the hour. This permits sorting collected data on just the hour. However, in the best of all possible worlds, we would like to have two collection schemes in progress, one starting and ending on the hour, and the second starting and ending on the half-hour. These overlapping hourly intervals would help us find more accurately the actual busiest period during the day, but as yet most PBXs and stand-alone data collection systems do not have such flexibility.

Why not measure in terms of half hours? Well, a shorter period doesn't give as big a sample, and in any statistical operation, the larger the sample, the more confidence we can have in the averages and other values we measure and the extreme values we choose to estimate. Second, we want the study period to be as much longer than the average call holding time as possible to minimize the impact of the small number of very long calls that may occur. Finally, "hours of use per hour" is a very nice unit. It does not need any sort of multiplying factors such as the 36 per hour with CCS or 60 per hour with minutes, and it translates directly into many other useful concepts. An hour of use per hour has a name: it is called an Erlang, after the Danish mathematician and telephone engineer, A. K. Erlang, who, back just after the turn of the century, developed many of our standard traffic formulas.

One other point. We look for the busy hour because, if we can handle our traffic then with a satisfactory grade of service, the "side hour" traffic can take care of itself. One of the most common approximations in business communication is the rule that the busy hour is one sixth of a day's traffic, or 6 busy hours equal one day, as suggested by Fig. 1-2. Expressed as a percentage, this comes out to 16% or 17%, depending on how you like to round numbers off. One-sixth is actually 0.1666666666... as far as you want to go. But note that this rule of thumb is just that, an educated guess. It is usually better to use actual measurements rather than estimates, if you have them available. But when you have only daily traffic, dividing by 6 is a better approximation than most.

Fig. 1-2. Suggesting six busy hours = one day.

At this point, a word of warning is in order. Much traffic data is collected and sorted in ways that do not reveal the true busy hour. For instance, call detail recording systems, recording bill-back information, are sometimes sorted monthly to obtain data for traffic engineering. All calls originating between 7 and 8, 8 and 9, 9 and 10, etc., are totaled in similar registers, without regard to the days on which they occurred. If these totals are divided by the number of business days per month, a daily pattern is obtained; however, in such a pattern, the "busy" hour is obviously an average busy hour, and may fall far short of the true busy hour. Where causal factors make, say, the first week in the month much heavier in telephone use than the remaining three, this can lead to disaster.

There is another approximation relating to busy hour traffic that is often employed: 22 business days equal one month. When dealing with a general month in network design, this is a good approximation, leading to the "magic number," 132. Obviously, 22*6=132, and suggests that the busy hour can be approximated from monthly traffic by dividing monthly hours of use by the magic number. However, for any particular month, particularly when used as a source of data, care must be taken. The exact period involved must be known (carriers seldom bill by the calendar month), and the actual number of days worked during that period must be counted.

Fig. 1-3 plots over the period of a year the actual traffic from months of varying lengths (dotted curve), and what that traffic would have been if all months had been 22 days long (solid curve). The difference is seen to be quite striking. To get the smoothed (solid) curve, one divides by the actual number of days worked to get the average traffic per day, and then multiplies by 22.

Fig. 1-3. Actual monthly traffic vs. traffic from 22-day months.

There are some interesting variations between busy hour and side hour traffic, which we will return to in greater detail later. For now, we simply want to note that first choice trunks get a larger proportion of the available traffic in the side hours than in the busy hours. This can be understood intuitively by noting that a lone overtime worker, when making telephone calls, will always get the WATS line and will never overflow to toll. During the day, however, when much more traffic is present, the WATS line or any other facility cannot handle more than one call at a time, and more than one hour of use per hour (one Erlang). This translates to 60 minutes or 36 CCS per hour. That is tops. So a bigger proportion of the offered traffic will go on to the next choice circuits.

The Erlang

So now we can return to the Erlang as a traffic unit. What does it mean and how can we use it? Well, for starters, let's assume we have exactly one trunk and every hour we measure the traffic it carries. Suppose we find it busy for 45 minutes. This would be 2700 seconds or 27 CCS, but it is also three quarters of an hour, or 0.75 Erlang. Which number, 45, 27 or 0.75 is the most useful? I vote for 0.75, because this number tells us the occupancy of the trunk during the hour in question. We know it is running at 75% occupancy, and thus is unused for 25% of the hour.

Now, suppose we have a group of 5 trunks, and our measurements say we are carrying 3 Erlangs of traffic. The average occupancy per trunk is 3 divided by 5, 0.6 or 60%. Now 3 Erlangs is 180 minutes or 108 CCS. But knowing we have 108 CCS on 5 trunks, we cannot find percent occupancy as easily; we first have to divide by 36. This illustrates our first major advantage of Erlangs over CCS or minutes of use per hour. It lets us estimate occupancy of our facilities easily, simplifying management by telling us how effectively we are using our resources.

But we get even more. If we are carrying 3 Erlangs on a group of trunks, we know that, on the average, 3 calls will be in the system. This is what the Erlang means...the average number of calls in progress at any one time. If we measure 14.73 erlangs on a big trunk group, we know that sometimes we had more and sometimes we had fewer calls in progress, but over the hour, the average number of calls in progress is 14.73. So the average occupancy during the hour and the average number of calls in progress is seen to be the same thing, as long as we measure in hours of use per hour, or Erlangs.

Consider an example. The Dimension 400 PBX has a switching matrix (the equipment that connects extensions to each other and to trunks) that can handle 1700 CCS in a busy hour. What does this mean? We can find out if we divide 1700 CCS by 36 to get 47.22 Erlangs, or about 47 calls present, on the average. Bell uses the Poisson tables (to be discussed in Chapter 2), and if we look up 47 Erlangs, we find that 64 servers will provide a grade of service of about P.01. Looking further at the Dimension, we discover that it has 64 time slots and can, as a result, handle 64 simultaneous calls, maximum. It seems to me that 47 calls on the average relates more reasonably to 64 time slots than does 1700 CCS.

In most traffic formulas and tables, traffic is represented by the letter A, either upper or lower case. Why A? Why not E for Erlang? Well, the name Erlang was attached to the traffic unit quite late in the game, and A has another meaning. It represents the Arrival rate, or the rate at which our calls, individually and collectively at random, appear at the trunk group to demand service. So what does 14.73 mean as an arrival rate?

We cannot tell without one more piece of information. We need to know the average length of a call, so that we can tell how many calls per hour 14.73 Erlangs implies. Assume we have 5-minute holding times, on the average. We then convert 14.73 Erlangs to minutes by multiplying by 60, and divide by 5 to find the number of calls involved. When we do the arithmetic, we get 176.76 calls. This is the average number of calls arriving per hour. To find the number of calls per minute, we now divide by 60, getting 2.95. This is the average arrival rate: 2.95 calls per minute arrive at our trunk group expecting service.

We could have gotten this number a lot easier by not first multiplying and then dividing by 60. That is, we could have divided 14.73 Erlangs by 5 minutes per call to get 2.95 calls per minute. This suggests that 14.73 is the average arrival rate during one holding time. Further, no matter what units the holding time is measured in, we can find the arrival rate during one of those units by simply dividing occupancy in Erlangs by the number of units per call.

Example: If the average holding time per call is 243 seconds and we still have our 14.73 Erlangs of use, the arrival rate is 14.73/243=0.0606 calls per second. In other words, the probability of a call arriving in any given second is 0.0606. We will make considerable use of this sort of thing later.

If calls arrive at the rate of 0.0606 per second, the reciprocal of the arrival rate is the average time between call arrivals; here, 1/0.606=16.5 seconds. Mean time between call arrivals is more used in data than voice engineering, but the relationship is useful to remember. One more thing: we have been calculating arrival rate on the basis of the traffic carried by our trunk group. Carried traffic is the only thing we can measure, but it differs from offered traffic in that sometimes all trunks will be busy and not all the offered traffic will get through. Thus some traffic will be lost. So what we have been figuring is the arrival rate of successful calls; what we really want is the arrival rate of all calls, including those that are lost. Further, the A that occurs in traffic formulas is for offered traffic, not carried traffic. There is a difference, and we will come back to it several times before we're done. Note that, if we have a good grade of service, the difference in offered and carried traffic is quite small (like 5% for a grade of service of B.05).

Just as arrival rate and mean time between call arrivals are reciprocals, so holding time and service rate are reciprocals. If we have five-minute calls, each server is serving them at the rate of 12 per hour, five minutes being one twelfth of an hour. The rate at which we service calls is the rate at which they depart the system. Thus at any instant, the departure rate is the number of calls in progress multiplied by he service rate, or, what is the same thing, the number of calls in progress divided by the average holding time per call.

Summary

Let's remember what "traffic engineering" is supposed to help us do: it is supposed to help us predict the future. We are using measurements made in the past and inserting them into mathematical models to predict future behavior of the system in question.

For example, suppose we have a "combination" trunk group of 17 trunks to take dial-9 traffic from our PBX to the central office, and to bring incoming calls to our console attendant from the CO. Everybody is complaining that they can't get out, and outside callers say they can't get in. We make a measurement. We find the traffic carried during the busy hour is about 14 Erlangs, or hours of use during that particular hour. The Erlang B mathematical model, to be discussed in Chapter 3 and summarized in Table I of Appendix II, indicates that 14 Erlangs on 17 trunks is pushing a B.20 grade of service; almost one call in 5 is hitting ATB or an All Trunks Busy condition. This is not good. But what do we do about it? We decide that a grade of service in the B.05 range will be about right. How many trunks must we add?

Well, Table I, at Grade of Service = B.20, tells us that if we are carrying 14 Erlangs on 17 trunks, the offered traffic had to be about 18 Erlangs. Now, it is obvious that some of this traffic is going to be retries, because so many people are not able to get out on the first attempt. However, for the moment, we will pretend that there are no retries. We will just assume that 18 Erlangs of traffic is offered, and see how many trunks are required for a B.05 grade of service. The B.05 section of Table I tells us that 23 trunks will be needed. So we add six, and everybody is happy.

Now we can refine matters a bit. Table I tells us that when we offered 18 hours to our original 17 trunks, the LOST traffic was about 3.6 hours. Considerable study has been given to retries, notably by James Jewett and others; Jewett's research indicates that about 55% of the lost traffic retries. If we take 55% of 3.6 hours, we get almost 2 hours of retry traffic. We pull this out of the 18 hours, and get what Jewett calls "First Attempt Traffic." That is, the offered traffic is actually about 16 hours if we pull out the retries. If we offer 16 Erlangs to 21 trunks, Table I predicts a grade of service fairly close to B.05. So we order 4 trunks rather than 6, save some money, and hope that things will improve when the changes are effected.

Let's recap what we have done. We have measured the carried traffic over what we hope is a busy hour; the carried traffic during the hour (divided by 1 hour) is the average traffic per hour. The study of statistics tells us that averages such as the mean value are very stable and are good estimates of what is going on. They are also easy to collect, and are thus relatively inexpensive. So we measure averages, and use them to estimate maximum values. Here we use the average number of calls in the system to estimate how many trunks are required so that only about 5% of the time will calls arriving at random find all trunks busy.

There are other measurements that we can make, and some of them are less expensive than average values. One of the most common is either a measure of ATB, or a count of the number of calls that overflow our trunk group. If we knew the duration of the ATB condition (measured in hours), we would have a direct measure of grade of service using Erlang B as our mathematical model. Or, if we knew the total number of offered calls, and divided it into the number of overflow calls, once again we'd have a measure of grade of service. But even if we did this, the results would not be too satisfactory. Why? Because we are dealing with extreme values which, under the same random conditions, will vary considerably from one hour to the next. Thus they are not as stable as the mean value, and they will not give us as reliable estimates of what we need to know.

To say it one more time, we measure mean or average values, and use them to predict the probability of occurrence of extreme values; then we size our systems accordingly. Grade of Service relates Offered or Carried traffic to Number of servers required. That's all there is to it...but, as we will see in subsequent chapters, it is quite enough.

[ Top ] [ Next Chapter ] [ Table of Contents ]


Copyright 2006 Lee Goeller. All Rights Reserved.