Hi Martine,
Apologies for the long delay in response. There are very minor differences
between the cutpoints chosen by the R and Stata versions of CEM. R will
attempt to find "pretty" cutpoints at round numbers for instance, whereas
Stata will simply divide the range of the variable in equally sized bins.
There's no way to have "imb" in Stata show you the exact cutpoint used,
but
if you run cem with the options "auto(scott)" and "showbreaks"
you'll see
the breaks chosen by the default algorithm in "imb." Hope that helps!
Cheers,
Matt
~~~~~~~~~~~
Matthew Blackwell
Associate Professor of Government
Harvard University
url:
On Tue, Feb 18, 2020 at 3:51 AM Blom, drs. M. - WODC/CRS <m.blom(a)wodc.nl>
wrote:
Hi All,
I can't get why the analyzes below, performed in Stata / R, do not produce the same
result.
Please provide your suggestions for the explanation for these differences.
Thanks,
Martine
PS (how) can I display the actual cutpoints in Stata (imb)?
-- STATA
. import delimited "G:\Recidive\Projecten\Verkeer\Algemeen\Artikel
effectiviteit ASP tijdens\Data\Data voor R.csv", clear
(8 vars, 8,627 obs)
. imb d_sekse lftbegrec d_addinfo_ind_best lftinsz1 vgalgexuz vgverkexuz
vgrijoiuz, treatment (treat)
(using the scott break method for L1 distance)
Multivariate L1 distance: .68243298
Univariate imbalance:
L1 mean min 25% 50%
75% max
d_sekse .01557 -.01557 0 0 0
0 0
lftbegrec .0687 -1.5626 0 -1 -2
-1 -5
d_addinfo_ind_best .00497 -.00497 0 0 0
0 0
lftinsz1 .08852 .91045 0 1 1
2 -12
vgalgexuz .14912 -1.2611 0 0 -1
-1 78
vgverkexuz .09852 -.41727 0 0 0
-1 4
vgrijoiuz .11859 -.36406 0 0 0
0 -4
--R
data <- read.csv(file =
"G:\\Recidive\\Projecten\\Verkeer\\Algemeen\\Artikel effectiviteit ASP
tijdens\\Data\\Data voor R.csv")
cov <- c("D_SEKSE",
"LFTBEGREC", "D_ADDINFO_IND_BEST","LFTINSZ1",
"VGALGEXUZ", "VGVERKEXUZ", "VGRIJOIUZ")
imb <- imbalance(group = data$TREAT, data =
data[cov])
imb
Multivariate Imbalance Measure: L1=0.665
Percentage of local common support: LCS=11.2%
Univariate Imbalance Measures:
statistic type L1 min 25% 50% 75% max
D_SEKSE -0.015571508 (diff) 0.015571508 0 0 0 0 0
LFTBEGREC -1.562561051 (diff) 0.025019213 0 -1 -2 -1 -5
D_ADDINFO_IND_BEST -0.004970458 (diff) 0.004970458 0 0 0 0 0
LFTINSZ1 0.910454843 (diff) 0.045867098 0 1 1 2 -12
VGALGEXUZ -1.261056065 (diff) 0.146272471 0 0 -1 -1 78
VGVERKEXUZ -0.417271627 (diff) 0.097957912 0 0 0 -1 4
VGRIJOIUZ -0.364063976 (diff) 0.118028583 0 0 0 0 -4
------------------------------
Dit bericht kan informatie bevatten die niet voor u is bestemd. Indien u
niet de geadresseerde bent of dit bericht abusievelijk aan u is
toegezonden, wordt u verzocht dat aan de afzender te melden en het bericht
te verwijderen. De Staat aanvaardt geen aansprakelijkheid voor schade, van
welke aard ook, die verband houdt met risico's verbonden aan het
elektronisch verzenden van berichten.
Ministerie van Justitie en Veiligheid
This message may contain information that is not intended for you. If you
are not the addressee or if this message was sent to you by mistake, you
are requested to inform the sender and delete the message. The State
accepts no liability for damage of any kind resulting from the risks
inherent in the electronic transmission of messages.
Ministry of Justice and Security
-
--
cem Mailing List, served by HUIT
Send messages: cem(a)lists.gking.harvard.edu
[un]subscribe Options:
http://lists.gking.harvard.edu/?info=cem
More information on cem:
http://gking.harvard.edu/cem
Cem mailing list
Cem(a)lists.gking.harvard.edu
To unsubscribe from this list or get other information:
https://lists.gking.harvard.edu/mailman/listinfo/cem