Several people have asked me about using propensity score matching with complex surveys, and whether/how weights, PSUs and strata should be taken into account. This paper explains how to handle these issues.
Why use survey weights?
I just came across an old paper by Kish that does a nice job describing some of the basic issues around survey weights:
WEIGHTING: WHY, WHEN, AND HOW?