IRT Modeling Lab

Estimating Parameters (1PL, 2PL, and 3PL)

A variety of programs are available for estimating IRT parameters, particularly for dichotomous unidimensional models. Two popular programs are PARSCALE (Muraki, 1990) and BILOG (Mislevy & Bock, 1991). Both are available from Scientific Software. In the examples that follow, the DOS version of BILOG input and output is discussed.

It is often necessary to estimate parameters for a large number of scales or experimental groups. For that purpose, we prefer the DOS version of BILOG because it is very easy to modify the input files and create MS-DOS batch files to conduct multiple runs. The DOS version of BILOG is actually a batch file (*.bat) that launches three executables.

The simplest way to estimate parameters for your data using BILOG is to copy your dataset into the BILOG directory. Then the BILOG input file extension must be modified and saved with the .blg extension.

Data File
Your data file must be saved in ASCII format (using Notepad for example). Note that BILOG requires an ID number for each respondent. To see how to prepare data for BILOG please click here.

Input File
Next an input file for BILOG must be created. Rather than starting from scratch, we suggest modifying one of the example input files that we provide.

AGREEABLENESS CALIBRATION FOR IRT TUTORIAL.
>COMMENT
>GLOBAL DFN='AGR2_CAL.DAT', NIDW=4, NPARM=3, OFNAME='OMIT.KEY', SAVE;
>SAVE SCO = 'AGR2_CAL.SCO', PARM = 'AGR2_CAL.PAR', COV = 'AGR2_CAL.COV';
>LENGTH NITEMS=(10);
>INPUT  SAMPLE=99999;
(4A1,10A1)
>TEST TNAME=AGR;
>CALIB  NQPT=40, CYC=100, NEW=30, CRIT=.001, PLOT=0;
>SCORE  MET=2, IDIST=0, RSC=0, NOPRINT;

If you are using the MS-DOS editor to create or edit this file, it should appear as follows:


Example data and appropriate BILOG files can be downloaded below.
agr2_cal.dat
con2_cal.dat
agr2_cal.blg
con2_cal.blg


Running BILOG
To run BILOG from a DOS prompt go to BILOG directory. Type 'Bilog' and the name of your the input file. For example, type "BILOG AGR2_cal". Note that the .blg extention is not necessary.

There are situations where you may want to run several BILOG runs consecutively. This can be accomplished easily using a MS-DOS batch file that calls BILOG several times. Note that it is necessary to use a separate input file for each data set. Here is a batch file (run.bat) that calls BILOG to do two runs using the example data. To execute the batch file, save it in the BILOG directory and type "run.bat'.

Output Files
BILOG output files for the example data can be downloaded below. We suggest that you download and print the files. A summary of the information in each file follows.
agr2_cal.ph1
agr2_cal.ph2
agr2_cal.ph3
agr2_cal.par
agr2_cal.sco
agr2_cal.cov

*.PH1
After running BILOG, first examine .ph1 file. This file contains a summary of your input file, the first two response vectors from your data file and a table of classical test theory (CTT) statistics. You should first check that your response data was read correctly and that the CTT statistics match those obtained during the data preparation phase.

In particular, look for items that exhibit extreme p-values (p>.95 or p<.05) or near zero (and of course negative) item-total correlations. These items may cause problems for the convergence of the parameter estimation procedure. Information regarding the IRT parameter estimation procedure is provided in the *.ph2 file.


If you are using the MS-DOS editor to view the agr1_cal.ph1 file, it should appear as follows:


As described above, the last portion of the file provides classical test statistics:

*.PH2
The *.ph2 file contains a summary of the input file specifications concerning parameter estimation, the quadrature points used for parameter estimation (we recommend 40), likelihood values across estimation cycles, and a table of item parameters and standard errors.

It is important to check that the estimation procedure converged; i.e. the largest change in any parameter for the last iteration was below the selected convergence criterion. In this case, a criterion of .001 was used. Lack of convergence is indicated by large changes in the likelihood function up to the maximum number of cycles.

If the estimation procedure did not converge, a number of strategies can be applied. First, one should try increasing the number of cycles. Second, it may be necessary either to specify an alternative prior distribution for one or more parameters (Bays prior distribution). Third, it is often necessary to delete items with low item-total correlations (e.g., < 0.1) as provided in the *.ph1 file's CTT statistics. Note BILOG will not converge if an item with a negative item-total correlation is present.


If you are using the MS-DOS editor to view the agr1_cal.ph2 file, we see that only 14 cycles were necessary to adequately estimate the parameters:


*.PH3
The *.ph3 file contains information concerning theta estimation.


The following output files are created only if explicitly requested using the 'PARM', 'COV' and 'SCO' commands in the BILOG input file.

*.PAR
The *.par file contains item parameters and their standard errors. For each item there are two lines. The first line contains the parameter estimates in the fourth, fifth, and seventh columns. The second line contains the standard errors for these estimates directly under the actual estimates.

In order to rearrange the parameters in a more easy to understand file, we have a program, PARTO3PL, that will read in all *.par files in a directory, delete the first four lines (title information) and the lines that contain the standard errors, leaving only the parameters available for later use in a *.3pl file. Such a file is useful when assessing model-data fit and with examining the presence of differential item functioning.

These parameters can be plotted with the MODFIT program (explained later in this tutorial). Below are the plots for the Agreeableness scale.

If you are using the MS-DOS editor to view the agr1_cal.par file, it should appear as follows:


By using these values with the equation for the 3PL model and varying levels of theta in a spreadsheet, plots can be generated. Later in this tutorial, you will be introduced to the MODFIT program, which can generate plots automatically after being provided with parameters.

Below are the plots for the Agreeableness scale that can be made from the parameters obtained in BILOG.

Graph
Plots of Agreeableness items
Previous   Next


Below are the plots for the Conscientiousness scale that can be made from the parameters obtained in BILOG.

Graph
Plots of Conscientiousness items
Previous   Next


*.SCO
The *.sco file is created in during Phase 3 and provides ability score information for each respondent. (Note that in the screen capture below, the last two columns are not discussed as they are beyond the scope of this tutorial, please consult your BILOG manual for these special applications, i.e., group fit probability and EAP scoring.)

View of the agr1_cal.sco file in the MS DOS editor:


*.COV
The *.cov file is similar to the *.par file in appearance, but provides parameters as well as the variances/covariances between the parameters. The covariances for the item parameter estimates are necessary for differential item functioning analyses (DIF, explained later in this tutorial).

View of the agr1_cal.cov file in the MS DOS editor:

Note: some of the arrows in the following picture are pointing at the wrong direction (e.g., arrows 5 to 9). Here is the correct format for covariance files:
Lines 1 & 2 contains the title records of the BILOG run that created the covariance file. Afterwards, there are two lines for each item. The first record contains
(1-4)
item name,
(5-12) subtest name,
arrow 1: (13-24) slope estimate (a),
arrow 2: (25-36) threshold estimate (b),
arrow 3: (37-48) lower asymptote estimate (c),
arrow 4: (49-60) estimation of error variance for slope (SEa2), and
arrow 5: (61-72) estimation of error covariance for slope and threshold (Cova,b).
The second record contains
(1-12) blank filler,
arrow 6: (13-24) estimation of error variance for threshold (SEb2),
arrow 7: (25-36) estimation of error covariance for slope and asymptote (Cova,c),
arrow 8: (37-48) estimation of the error covariance for threshold and asymptote ((Covb,c), and
arrow 9: (49-60) the estimate of the error variance for lower asymptote (SEEc2).
Each column is formatted as (f12.6).

 
Back