HD 1991 - Hodgkin's disease - data structure
and protocols
[Return to the Contents
List]
Contents List
HD
1991 data preparation: brief written protocol
Strict confidentiality of trial results is
observed. Information is held in the Clinical Trial Service Unit computers
in a form which can be accessed only by known individuals. All patient
records are converted into 'pink form' (1991) format (described below)
if not already supplied in it. Results received as tables are converted
into sets of synthetic 'pink form' records. The following routine checks
(where appropriate) are performed on every 'pink form' compilation:
-
Duplicate patient entries
-
Patient identifier missing
-
Gender missing
-
Birth date missing
-
Laparotomy missing
-
Stage and/or systemic symptoms missing
-
Initial treatment date missing
-
Randomisation date missing
-
Treatment allocation missing
-
Primary treatment outcome missing
-
Outcome evaluation date missing
-
Recurrence date missing
-
Survival status missing
-
Death date missing
-
Birth date wrong
-
Initial treatment date wrong or out of range
-
either Initial treatment date after,
or more than 1y before randomisation date
-
or Initial treatment date before, or
more than 1y after randomisation date
-
or Time between initial treatment date
and randomisation date exceeds 30 days
-
Randomisation date wrong, before 1945 or out
of range
-
Outcome evaluation date wrong or out of range
-
Outcome evaluation date more than 1y after
initial treatment date
-
Recurrence date wrong or out of range
-
Last follow-up or death date wrong or out
of range
-
Gender code unknown
-
Laparotomy code unknown
-
Stage and/or systemic symptom code unknown
-
Treatment allocation code unknown
-
Primary treatment outcome code unknown
-
Survival status code unknown
-
Recurrence flag error
-
Cause of death given when alive
The total numbers of patients and the distributions
of randomisation age, disease stage, systemic symptoms and primary treatment
outcome are checked for any significant imbalance between treatment groups.
These four distributions are compared as follows. Patients are grouped
into three categories according to randomisation age (below 30 years; 30
- 49 years or unknown; 50 years or above) and a chi-squared test
is applied to the population of the three categories found in each treatment
group. Similarly, five categories are formed for disease stage (I; II;
III; IV; unknown), three for systemic symptoms (absent; present; unknown)
and three for primary treatment outcome (complete remission; other failures;
unknown) and these are tested in the same way as the categories formed
for randomisation ages.
If an event such as the recurrence of disease
is reported at a date later than the quoted last follow-up date, the last
follow-up date is automatically changed to the later date. The completeness
of follow-up is then calculated for the end of each calendar year. The
distributions of randomisation dates, randomisation ages and time elapsed
since last follow-up are checked for any significant imbalance between
treatment groups in two ways as follows. Firstly, a t-test is applied
to the difference between the mean value of each distribution for patients
in each group with the corresponding mean for patients in the remainder.
Secondly, an F-ratio is calculated for each distribution by comparing
the variance between the groups with the variance within the groups. The
distribution of time elapsed since last follow-up is also checked in these
two ways for any significant imbalance between those patients with and
those patients without a recorded recurrence of disease. Finally, the distribution
of time elapsed since randomisation is checked in the same two ways for
any significant imbalance between patients in four categories of disease
stage (I; II; III; IV), two categories of systemic symptoms (absent; present)
and two categories of primary treatment outcome (complete remission; other
failures).
Where patient serial numbers form an obvious
sequence it is checked for missing numbers.
A tabulated breakdown of variables is produced
for each trial, together (where relevant) with lists of patients in 'problematical'
categories such as those with lapsed follow-up, uncertain death cause or
second malignancy site. Graphs of accrual date and the proportion of living
patients still on follow-up as a function of time from randomisation by
treatment allocation are also produced, together with Kaplan-Meier life-table
curves. Before trial data are finally incorporated into the overview, the
analyses described above are sent to the participating trialist(s) for
checking and approval.
Contact
Please address inquiries concerning data
preparation and checking to:
Vaughan Evans
IHDCG Secretariat
C.T.S.U.
Radcliffe Infirmary
Oxford OX2 6HE
England
Tel. U.K.(44)-Oxford(1865)-557241; FAX: U.K.(44)-Oxford(1865)-558817
Specification
of HD 1991 'pink form' format
Item |
Description |
FORTRAN |
Columns |
Details |
Abbreviation |
0 |
Trial/stratum identifying code |
I6 |
1 - 6 |
|
Trial |
1 |
Patient identifier (or sequence number) |
A12 |
8 - 19 |
|
Patient |
2 |
Gender |
I1 |
21 |
Value |
Description |
Abbreviation |
1 |
Male |
M |
2 |
Female |
F |
|
Sex |
3 |
Date of birth |
I6 |
23 - 28 |
DDMMYY |
Birth Date |
4 |
Laparotomy |
I1 |
30 |
Value |
Description |
Abbreviation |
1 |
No |
No |
2 |
Yes |
Yes |
|
Lap. |
5 |
Disease stage |
I1 |
32 |
Value |
Description |
Abbreviation |
1 |
I |
I |
2 |
II |
II |
3 |
III |
III |
4 |
IV |
IV |
|
Stage |
6 |
Systemic symptoms |
I1 |
34 |
Value |
Description |
Abbreviation |
1 |
Absent |
No |
2 |
Present |
Yes |
|
Symp. |
7 |
Date of initial treatment/diagnosis |
I6 |
36 - 41 |
DDMMYY |
1-Trt. Date |
8 |
Randomisation date |
I6 |
43 - 48 |
DDMMYY |
Entry Date |
9 |
Treatment group allocated (as on master
list) |
I1 |
50 |
|
Trt. Grp |
10 |
Primary treatment outcome |
I1 |
52 |
Value |
Description |
Abbreviation |
1 |
Complete remission |
CR |
2 |
Other failures |
Ofail |
|
Prim. Outc |
11 |
Outcome evaluation date |
I6 |
54 - 59 |
DDMMYY |
Outcome Date |
12 |
Recurrence |
I1 |
61 |
Value |
Description |
Abbreviation |
1 |
No |
No |
2 |
Yes |
Yes |
|
Rec |
13 |
Date of first recurrence |
I6 |
63 - 68 |
DDMMYY |
Rec. Date |
14 |
State when last traced |
I1 |
70 |
Value |
Description |
Abbreviation |
1 |
Alive |
Alive |
2 |
Dead |
Dead |
3 |
Lost |
Lost |
|
State |
15 |
Date died or last traced |
I6 |
72 - 77 |
DDMMYY |
L.F.U. |
16 |
Cause of death (extra category); there
is also a 'pre-diktat' Cause of Death item, with the same coding, in form
columns 79-80 |
I2 |
82 - 83 |
Value |
Description |
Abbreviation |
1 |
Iatrogenic |
Iatro |
2 |
Infective |
Infec |
3 |
Leukaemia |
Leuk |
4 |
Solid neoplasm |
Neop2 |
5 |
Cardiovascular |
CardV |
6 |
- |
|
7 |
- |
|
8 |
Extraneous cause |
Extra |
9 |
Not 1-5,8,13,14 and not Hodgkin's dis. |
NotHD |
10 |
Not Hodgkin's disease |
NotHD |
11 |
Hodgkin's disease |
HD |
12 |
Unascertainable cause |
Unkn. |
13 |
Pulmonary |
Pulmo |
14 |
Non-Hodgkin's lymphoma |
NonHL |
|
Death |
17 |
Name (if given), cause of death and comments |
A |
85 - end |
|
Name |
Missing or unknown items are left blank or
set to zero.
HD
1991 data form rubric
GUARANTEE OF CONFIDENTIALITY OF DATA
ANY INFORMATION PROVIDED OVERLEAF TO THE
COLLABORATIVE GROUP SECRETARIAT WILL BE HELD SECURELY AND IN STRICT CONFIDENCE.
NOTES ON FORMAT OF DATA REQUESTED OVERLEAF
Special coding conventions
Please accompany these forms by an explanatory
letter about any special coding conventions (e.g. on histology or
stage) that you have used, plus notes on any special features of the study(s)
to which you wish to draw particular attention.
Dates that are not (or not yet) known exactly
either leave DAY blank, and give
(approximate or provisional) month and year;
or leave DAY and MONTH blank, and
just give approximate year.
BASELINE DATA
Patient identifier
Use any convenient convention you wish
that will enable the patient to be identified if any questions arise. (If
reporting several trials, please try to use an identifier that implicitly
specifies both the trial and the patient.)
Gender
1 = male; 2 = female (or M = male; F =
female.)
Date of birth (or age)
Give age if date of birth not readily
available: see note above on approximate dates.
Laparotomy
0 or blank = not (yet) currently available;
1 = no; 2 = yes.
Stage
0 or blank = not (yet) currently available;
1 = I; 2 = II; 3 = III; 4 = IV. If you prefer to use your own classification
of stage please do so and explain details of it.
Systemic symptoms
0 or blank = not (yet) conveniently available;
1 = absent; 2 = present (or A = absent; P = present.)
Date of initial treatment (or of diagnosis)
See note above on approximate dates.
Date randomised
Please describe ALL patients EVER randomised,
including
even lost or withdrawn patients, and ignore all non-randomised patients.
Treatment group allocated
Treatment group number: 1 or 2 only, for
2-group trials, 3, 4 etc for trials with more arms, as defined by you at
the top of the form. N.B: even if, in reality, some quite different
(or even opposite!) treatment was inadvertently given, what is wanted is
the originally-allocated treatment. (If any patients were erroneously
entered more than once, give only the first allocation.)
FOLLOW-UP DATA
Primary treatment outcome
0 or blank = not (yet) known; 1 = complete
remission; 2 = failure to achieve complete remission (e.g. PD, NC,
PR, death without remission).
Outcome evaluation date
See note above on approximate dates.
Recurrence?
1 = none recorded; 2 = some recurrence.
Approx. date of recurrence
Give the best estimate you can: see note
above on approximate dates.
Dead/other
1 = alive when last traced; 2 = known
to be dead.
Date died/last traced
Date of death, or date last known to be
alive, as accurately as possible: see note above on approximate dates.
Death cause
If the patient has died, give underlying
cause of death. Either state the cause in words, use an ICD code or use
your own classification and explain details of it.
[Return to the Contents
List]
[End of document, updated to 1 September
2000]