The assumption implies that E[Y1i|Di=1,Xi]=E[Y1i|Di=0,Xi]=E[Y1i|Xi]E[Y0i|Di=1,Xi]=E[Y0i|Di=0,Xi]=E[Y0i|Xi]
The ATT for Xi=x is given by E[Y1i−Y0i|Di=1,Xi]=E[Y1i|Di=1,Xi]−E[Y0i|Di=1,Xi]=E[Yi|Di=1,Xi]−E[Y0i|Di=0,Xi]=E[Yi|Di=1,Xi]⏟avg with Xi in treatment−E[Yi|Di=0,Xi]⏟avg with Xi in control
The components in the last line are identified (can be estimated).
Intuition: Comparing the outcome across control and treatment groups after conditioning on Xi
ATT is given by ATT=E[Y1i−Y0i|Di=1]=∫E[Y1i−Y0i|Di=1,Xi=x]fXi(x|Di=1)dx=E[Yi|Di=1]−∫(E[Yi|Di=0,Xi=x])fXi(x|Di=1)
ATE is ATE=E[Y1i−Y0i]=∫E[Y1i−Y0i|Xi=x]fXi(x)dx=∫E[Yi|Di=1,Xi=x]fXi(x)dx=+∫E[Yi|Di=0,Xi=x]fXi(x)dx