Collecting and examining datasets on ethnicity and religion involves translating and codifying real-world phenomena such as actions taken by governments and other groups into data which can be analyzed by social science statistical techniques. This methodology is intended to be applied to phenomena which in their original form are in a format not readily accessible to statistical analyses, i.e. “softer” phenomena and events such as government policies and conflict behavior. Thus, this methodology is not necessary for phenomena like GDP or government military spending, but is based on behavior by organizations or groups of individuals which are assessed by a coder who translates this behavior into data. Aggregate data collected by this methodology should have three qualities. First, they must be reproducible. Second, the data must be transparent in that all aspects of the data collection process and its products be clear and understandable to other researchers, to the extent that they could, in theory, be replicated. Third, it must measure what it intends to measure in a clear, accurate, and precise manner. A project which accomplishes all of this must be conceptualized properly from the beginning, including the decision on which unit of analysis to use and which cases to include and exclude. It must have appropriate sources and a tight variable design. Finally, the data must be collected in a systematic, transparent, and reproducible manner based upon appropriate sources.