Download as txt, pdf, or txt
Download as txt, pdf, or txt
You are on page 1of 10

===========================================================================

STATISTICS FOR "DDSEARCH"."TXT_INX_JOBS_SEARCH"


===========================================================================

indexed documents: 1,997


allocated docids: 2,003
$I rows: 136,345

---------------------------------------------------------------------------
TOKEN STATISTICS
---------------------------------------------------------------------------

unique tokens: 25,343


average $I rows per token: 5.38
tokens with most $I rows:
N (0:TEXT) 181
2012 (0:TEXT) 181
OF (0:TEXT) 177
AND (0:TEXT) 176
THE (0:TEXT) 175
IS (0:TEXT) 174
TO (0:TEXT) 173
PATIENT (0:TEXT) 172
IN (0:TEXT) 172
A (0:TEXT) 172
WITH (0:TEXT) 170
ON (0:TEXT) 166
M.D (0:TEXT) 165
NO (0:TEXT) 164
WAS (0:TEXT) 163
HAS (0:TEXT) 162
FOR (0:TEXT) 162
AT (0:TEXT) 160
AS (0:TEXT) 159
OR (0:TEXT) 157
NOT (0:TEXT) 157
THIS (0:TEXT) 156
10 (0:TEXT) 156
HAVE (0:TEXT) 151
12 (0:TEXT) 151
ANY (0:TEXT) 149
HISTORY (0:TEXT) 148
WHICH (0:TEXT) 145
HAD (0:TEXT) 145
HER (0:TEXT) 143
BE (0:TEXT) 143
TIME (0:TEXT) 142
BEEN (0:TEXT) 141
WILL (0:TEXT) 140
IT (0:TEXT) 140
BUT (0:TEXT) 140
THAT (0:TEXT) 139
ARE (0:TEXT) 139
SHE (0:TEXT) 138
HE (0:TEXT) 137
DATE (0:TEXT) 136
I (0:TEXT) 134
DR (0:TEXT) 134
WELL (0:TEXT) 132
LEFT (0:TEXT) 132
THERE (0:TEXT) 131
ALSO (0:TEXT) 131
PAIN (0:TEXT) 129
S (0:TEXT) 128
NORMAL (0:TEXT) 128
HIS (0:TEXT) 128
BY (0:TEXT) 128
2 (0:TEXT) 128
UP (0:TEXT) 127
PAGE (0:TEXT) 127
MEDICAL (0:TEXT) 127
FROM (0:TEXT) 127
SOME (0:TEXT) 126
AN (0:TEXT) 126
RIGHT (0:TEXT) 125
IF (0:TEXT) 125
DOES (0:TEXT) 125
TWO (0:TEXT) 123
MEDICATIONS (0:TEXT) 122
WHO (0:TEXT) 121
FAMILY (0:TEXT) 121
BACK (0:TEXT) 120
WE (0:TEXT) 119
PAST (0:TEXT) 119
ONE (0:TEXT) 119
DID (0:TEXT) 118
PHYSICAL (0:TEXT) 117
LAST (0:TEXT) 115
EXAMINATION (0:TEXT) 115
WERE (0:TEXT) 114
THREE (0:TEXT) 114
SEE (0:TEXT) 114
1 (0:TEXT) 113
PRESSURE (0:TEXT) 110
BLOOD (0:TEXT) 110
CONTINUE (0:TEXT) 109
SYMPTOMS (0:TEXT) 108
MONTHS (0:TEXT) 108
PLAN (0:TEXT) 107
MILD (0:TEXT) 107
HIM (0:TEXT) 107
SIGNIFICANT (0:TEXT) 106
PRESENT (0:TEXT) 106
ABOUT (0:TEXT) 106
VERY (0:TEXT) 105
DISEASE (0:TEXT) 105
5 (0:TEXT) 105
IMPRESSION (0:TEXT) 104
AFTER (0:TEXT) 104
WHEN (0:TEXT) 103
TODAY (0:TEXT) 103
ME (0:TEXT) 103
20 (0:TEXT) 103
WEEKS (0:TEXT) 102
DO (0:TEXT) 102

average size per token: 79


tokens with largest size:
THE (0:TEXT) 28,480
AND (0:TEXT) 22,461
OF (0:TEXT) 21,662
IS (0:TEXT) 18,312
TO (0:TEXT) 17,567
M.D (0:TEXT) 16,479
IN (0:TEXT) 14,044
A (0:TEXT) 13,482
SHE (0:TEXT) 12,611
WITH (0:TEXT) 12,598
PATIENT (0:TEXT) 12,179
FOR (0:TEXT) 10,619
ON (0:TEXT) 9,990
2012 (0:TEXT) 9,899
NO (0:TEXT) 9,799
HAS (0:TEXT) 9,733
HE (0:TEXT) 9,636
S (0:TEXT) 9,229
NOT (0:TEXT) 8,424
WAS (0:TEXT) 8,413
I (0:TEXT) 8,269
HER (0:TEXT) 8,162
AT (0:TEXT) 8,029
N (0:TEXT) 7,859
WILL (0:TEXT) 7,615
OR (0:TEXT) 7,557
AS (0:TEXT) 7,250
THAT (0:TEXT) 7,176
THIS (0:TEXT) 7,144
RIGHT (0:TEXT) 6,533
PAGE (0:TEXT) 6,480
LEFT (0:TEXT) 6,286
HISTORY (0:TEXT) 6,040
HAVE (0:TEXT) 5,920
BUT (0:TEXT) 5,725
COUNTY (0:TEXT) 5,531
PHYSICAL (0:TEXT) 5,513
HAND (0:TEXT) 5,432
HIS (0:TEXT) 5,346
PAIN (0:TEXT) 5,244
EXAMINATION (0:TEXT) 5,239
ARE (0:TEXT) 5,142
DATE (0:TEXT) 5,071
PLAN (0:TEXT) 4,935
BE (0:TEXT) 4,874
WELL (0:TEXT) 4,718
ANY (0:TEXT) 4,662
2 (0:TEXT) 4,618
10 (0:TEXT) 4,560
NORMAL (0:TEXT) 4,439
TODAY (0:TEXT) 4,369
THERE (0:TEXT) 4,357
CENTER (0:TEXT) 4,339
WE (0:TEXT) 4,276
HAD (0:TEXT) 4,155
IT (0:TEXT) 4,077
12 (0:TEXT) 4,045
DOES (0:TEXT) 4,043
FAX (0:TEXT) 4,010
RE (0:TEXT) 3,823
P.C (0:TEXT) 3,799
PHILADELPHIA (0:TEXT) 3,782
DR (0:TEXT) 3,775
MD (0:TEXT) 3,772
JERSEY (0:TEXT) 3,771
WHICH (0:TEXT) 3,758
SOME (0:TEXT) 3,696
BEEN (0:TEXT) 3,690
IMPRESSION (0:TEXT) 3,593
NEW (0:TEXT) 3,577
ALSO (0:TEXT) 3,577
EXAM (0:TEXT) 3,542
AN (0:TEXT) 3,487
MAIN (0:TEXT) 3,474
PHONE (0:TEXT) 3,416
DOB (0:TEXT) 3,414
JOHN (0:TEXT) 3,409
M (0:TEXT) 3,400
M.D., (0:TEXT) 3,344
BACK (0:TEXT) 3,333
WERE (0:TEXT) 3,331
TIME (0:TEXT) 3,305
UP (0:TEXT) 3,274
FROM (0:TEXT) 3,234
FOLLOW-UP (0:TEXT) 3,216
IF (0:TEXT) 3,196
CHERRY (0:TEXT) 3,194
MEDICATIONS (0:TEXT) 3,192
SOUTH (0:TEXT) 3,138
SEE (0:TEXT) 3,134
LINE (0:TEXT) 3,121
LAST (0:TEXT) 3,094
READ (0:TEXT) 3,053
CHESTER (0:TEXT) 3,039
DELAWARE (0:TEXT) 3,038
HILL (0:TEXT) 3,035
LANDING (0:TEXT) 3,031
PENNSYLVANIA (0:TEXT) 3,025
MONTGOMERY (0:TEXT) 3,025
BUCKS (0:TEXT) 3,019

average frequency per token: 18.03


most frequent tokens:
N (0:TEXT) 1,979
2012 (0:TEXT) 1,958
OF (0:TEXT) 1,946
THE (0:TEXT) 1,939
PATIENT (0:TEXT) 1,927
AND (0:TEXT) 1,917
IS (0:TEXT) 1,900
TO (0:TEXT) 1,869
IN (0:TEXT) 1,834
M.D (0:TEXT) 1,802
A (0:TEXT) 1,790
FOR (0:TEXT) 1,670
WITH (0:TEXT) 1,653
DATE (0:TEXT) 1,617
PAGE (0:TEXT) 1,598
NOT (0:TEXT) 1,566
ON (0:TEXT) 1,514
WILL (0:TEXT) 1,506
NO (0:TEXT) 1,506
I (0:TEXT) 1,473
HAS (0:TEXT) 1,442
AT (0:TEXT) 1,331
PHYSICAL (0:TEXT) 1,326
S (0:TEXT) 1,317
WAS (0:TEXT) 1,306
THIS (0:TEXT) 1,292
EXAMINATION (0:TEXT) 1,288
OR (0:TEXT) 1,253
BUT (0:TEXT) 1,239
AS (0:TEXT) 1,222
FAX (0:TEXT) 1,204
PLAN (0:TEXT) 1,200
10 (0:TEXT) 1,174
12 (0:TEXT) 1,152
THAT (0:TEXT) 1,146
DOB (0:TEXT) 1,117
HAVE (0:TEXT) 1,100
TODAY (0:TEXT) 1,084
HISTORY (0:TEXT) 1,070
ARE (0:TEXT) 1,062
2 (0:TEXT) 1,012
WELL (0:TEXT) 1,009
PHONE (0:TEXT) 1,006
M.D., (0:TEXT) 997
BE (0:TEXT) 986
SHE (0:TEXT) 982
PAIN (0:TEXT) 956
DR (0:TEXT) 949
HER (0:TEXT) 938
RE (0:TEXT) 933
IMPRESSION (0:TEXT) 926
RIGHT (0:TEXT) 923
LEFT (0:TEXT) 907
THERE (0:TEXT) 905
ANY (0:TEXT) 903
SUITE (0:TEXT) 898
MAIN (0:TEXT) 896
HE (0:TEXT) 884
NAME (0:TEXT) 879
CENTER (0:TEXT) 862
NEW (0:TEXT) 858
WE (0:TEXT) 850
HAD (0:TEXT) 849
M (0:TEXT) 847
MD (0:TEXT) 836
BEEN (0:TEXT) 829
HAND (0:TEXT) 828
WHICH (0:TEXT) 827
FOLLOW-UP (0:TEXT) 827
DOES (0:TEXT) 816
K (0:TEXT) 815
COUNTY (0:TEXT) 810
EXAM (0:TEXT) 801
IT (0:TEXT) 799
JOHN (0:TEXT) 798
C (0:TEXT) 797
W (0:TEXT) 795
SOUTH (0:TEXT) 792
UP (0:TEXT) 789
MEDICATIONS (0:TEXT) 787
H (0:TEXT) 786
MRN (0:TEXT) 784
DAVID (0:TEXT) 781
LINE (0:TEXT) 777
NORMAL (0:TEXT) 776
SEND (0:TEXT) 775
LAST (0:TEXT) 775
MARK (0:TEXT) 770
READ (0:TEXT) 764
P.C (0:TEXT) 764
PH.D (0:TEXT) 763
LEE (0:TEXT) 760
STEPHANIE (0:TEXT) 758
NEAL (0:TEXT) 757
RANDALL (0:TEXT) 756
HILL (0:TEXT) 756
CHERRY (0:TEXT) 756
PETER (0:TEXT) 755
LANDING (0:TEXT) 755
CHESTER (0:TEXT) 755

token statistics by type:


token type: 0:TEXT
unique tokens: 25,343
total rows: 136,345
average rows: 5.38
total size: 2,011,745 (1.92 MB)
average size: 79
average frequency: 18.03
most frequent tokens:
N 1,979
2012 1,958
OF 1,946
THE 1,939
PATIENT 1,927
AND 1,917
IS 1,900
TO 1,869
IN 1,834
M.D 1,802
A 1,790
FOR 1,670
WITH 1,653
DATE 1,617
PAGE 1,598
NOT 1,566
ON 1,514
WILL 1,506
NO 1,506
I 1,473
HAS 1,442
AT 1,331
PHYSICAL 1,326
S 1,317
WAS 1,306
THIS 1,292
EXAMINATION 1,288
OR 1,253
BUT 1,239
AS 1,222
FAX 1,204
PLAN 1,200
10 1,174
12 1,152
THAT 1,146
DOB 1,117
HAVE 1,100
TODAY 1,084
HISTORY 1,070
ARE 1,062
2 1,012
WELL 1,009
PHONE 1,006
M.D., 997
BE 986
SHE 982
PAIN 956
DR 949
HER 938
RE 933
IMPRESSION 926
RIGHT 923
LEFT 907
THERE 905
ANY 903
SUITE 898
MAIN 896
HE 884
NAME 879
CENTER 862
NEW 858
WE 850
HAD 849
M 847
MD 836
BEEN 829
HAND 828
WHICH 827
FOLLOW-UP 827
DOES 816
K 815
COUNTY 810
EXAM 801
IT 799
JOHN 798
C 797
W 795
SOUTH 792
UP 789
MEDICATIONS 787
H 786
MRN 784
DAVID 781
LINE 777
NORMAL 776
SEND 775
LAST 775
MARK 770
READ 764
P.C 764
PH.D 763
LEE 760
STEPHANIE 758
NEAL 757
RANDALL 756
HILL 756
CHERRY 756
PETER 755
LANDING 755
CHESTER 755

---------------------------------------------------------------------------
FRAGMENTATION STATISTICS
---------------------------------------------------------------------------

total size of $I data: 2,011,745 (1.92 MB)

$I rows: 136,345
estimated $I rows if optimal: 25,452
estimated row fragmentation: 81 %

garbage docids: 6
estimated garbage size: 5,001 (4.88 KB)

most fragmented tokens:


FOLLOW-UP (0:TEXT) 99 %
FOLLOW (0:TEXT) 99 %
FIVE (0:TEXT) 99 %
FIRST (0:TEXT) 99 %
FEMALE, (0:TEXT) 99 %
FEELS (0:TEXT) 99 %
FEELING (0:TEXT) 99 %
FEEL (0:TEXT) 99 %
FAMILY (0:TEXT) 99 %
EXTREMITY (0:TEXT) 99 %
EXTREMITIES (0:TEXT) 99 %
EXAM (0:TEXT) 99 %
EVIDENCE (0:TEXT) 99 %
EVALUATION (0:TEXT) 99 %
EDEMA (0:TEXT) 99 %
DURING (0:TEXT) 99 %
DUE (0:TEXT) 99 %
DR (0:TEXT) 99 %
DOWN (0:TEXT) 99 %
DONE (0:TEXT) 99 %
DOING (0:TEXT) 99 %
DOB (0:TEXT) 99 %
DO (0:TEXT) 99 %
DISEASE (0:TEXT) 99 %
DISCUSSED (0:TEXT) 99 %
DID (0:TEXT) 99 %
DIABETES (0:TEXT) 99 %
DENIES (0:TEXT) 99 %
DECREASED (0:TEXT) 99 %
DEAR (0:TEXT) 99 %
DAYS (0:TEXT) 99 %
DAY (0:TEXT) 99 %
DATE (0:TEXT) 99 %
CURRENTLY (0:TEXT) 99 %
CURRENT (0:TEXT) 99 %
COULD (0:TEXT) 99 %
CONTINUE (0:TEXT) 99 %
COMPLAINTS (0:TEXT) 99 %
CLEAR (0:TEXT) 99 %
CHRONIC (0:TEXT) 99 %
CHEST (0:TEXT) 99 %
CHANGES (0:TEXT) 99 %
CHANGE (0:TEXT) 99 %
CC (0:TEXT) 99 %
CARE (0:TEXT) 99 %
CARDIAC (0:TEXT) 99 %
CAN (0:TEXT) 99 %
BY (0:TEXT) 99 %
BUT (0:TEXT) 99 %
BOTH (0:TEXT) 99 %
BLOOD (0:TEXT) 99 %
BILATERALLY (0:TEXT) 99 %
BILATERAL (0:TEXT) 99 %
BETTER (0:TEXT) 99 %
BEING (0:TEXT) 99 %
BEEN (0:TEXT) 99 %
BECAUSE (0:TEXT) 99 %
BE (0:TEXT) 99 %
BACK (0:TEXT) 99 %
AS (0:TEXT) 99 %
AROUND (0:TEXT) 99 %
AREA (0:TEXT) 99 %
ARE (0:TEXT) 99 %
ANY (0:TEXT) 99 %
AN (0:TEXT) 99 %
ALSO (0:TEXT) 99 %
ALLERGIES (0:TEXT) 99 %
ALL (0:TEXT) 99 %
ALCOHOL (0:TEXT) 99 %
AGO (0:TEXT) 99 %
AGAIN (0:TEXT) 99 %
AFTER (0:TEXT) 99 %
ACUTE (0:TEXT) 99 %
ABOVE (0:TEXT) 99 %
ABOUT (0:TEXT) 99 %
ABLE (0:TEXT) 99 %
ABDOMEN (0:TEXT) 99 %
80 (0:TEXT) 99 %
7 (0:TEXT) 99 %
60 (0:TEXT) 99 %
6 (0:TEXT) 99 %
50 (0:TEXT) 99 %
5 (0:TEXT) 99 %
40 (0:TEXT) 99 %
4 (0:TEXT) 99 %
30 (0:TEXT) 99 %
3 (0:TEXT) 99 %
25 (0:TEXT) 99 %
20 (0:TEXT) 99 %
2 (0:TEXT) 99 %
15 (0:TEXT) 99 %
12 (0:TEXT) 99 %
11 (0:TEXT) 99 %
10 (0:TEXT) 99 %
1 (0:TEXT) 99 %
09 (0:TEXT) 99 %
08 (0:TEXT) 99 %
04 (0:TEXT) 99 %
02 (0:TEXT) 99 %
01 (0:TEXT) 99 %

You might also like