Limitations
 
Adult Cases Only
This website currently only catalogs cases that were handled in adult court, not juvenile court. The website does include cases against juveniles that were handled and completed in adult court.
Human Error
Nearly all of the data used to construct this website are subject to human error. Police officers enter reports on arrests and court clerks enter data on cases. Mistakes made during data entry can appear in our data. Although these errors exist on individual cases, we have no reason to think that errors are common, and their impact is minimized when large amounts of data are available.
We can only measure and report on things that we have data on
Our data systems are far from perfect, and we do not always have analyzable data on everything that we would like to be able to report on. Because of that, there are reports that would be very useful for transparency and accountability that we cannot make. As an example, as of yet, we do not electronically record the amount of bail our office requests at bail hearings, so we cannot report on how our bail requests match or differ from bail assigned by a bail magistrate. We are working to improve our systems so that we can better record and report on everything that happens in the criminal justice system.
Year to Date Reports Are Subject to Large Fluctuations Early in the Year
Unless otherwise noted, our reports are ‘year to date’ reports. These allow a person to compare what has happened from January 1 until today in each of the years reported on. During the first month or two of the year, when fewer cases have been processed, it is harder to compare case numbers from year to year. This is because when case numbers are low, a small group of anomalous cases can skew the results. Take with a grain of salt year over year comparisons during this time period. If you are interested in longer term reports, we have published a version of each report in a ‘year end’ version that shows full year results for all years but the current year. These are accessible through the menu at the top of the page.
Offense Categorizations Happen Early in the Process

Categorization of events in the criminal justice system happen very early in the process, often before the District Attorney’s Office charges a case. Initial classifications can change over time as the case evolves and more information becomes available. The data in this dashboard adhere to the original classification of a case rather that the classification of a case as it moves through the system. This has both strengths and limitations. The advantage in using this system is that if an incident is classified as, for example, drug sales, the arrest, charging, case outcomes, and future years of incarceration reports will all report on that case as a drug sale, even if the case is ultimately only charged as possession. By maintaining the original classification, a case can be “tracked” through the system, and in turn it becomes possible to know the outcomes for all cases that began in a particular category.

  • First, it is possible that the District Attorney’s Office charged the case differently than it was originally classified. As an example, an assault may initially be classified as “aggravated” because of the presence of a weapon. The District Attorney’s Office may not have found enough evidence that the weapon was used and may have only charged the case as a simple assault. The classification of aggravated assault, then, does not accurately reflect the incident as it moves through the courts, although it accurately reflects the assault at the time of the incident and arrest.

  • Second, the classification system that is used for offenses only allows for a single classification for each incident. This can lead to distortions when a person is ultimately charged with multiple cases from a single arrest. For example, if an individual shoots (non-fatally) three people in the course of a robbery, under this system all four offenses (the robbery and the three shootings) may be classified as “Robbery/Gun” even though three of the four cases that come from the incident would be for the shooting itself and not the robbery. This is because robbery with a gun is classified as “more serious” than an aggravated assault with a gun.

Offenses May Be Recategorized Over Time
Over time, offenses may be recategorized if underlying facts change. For example, if a person is shot but not killed, the offense may be categorized as an aggravated assault with a gun. If the victim later dies, the offense may be recategorized as homicide. When offenses are recategorized in this way, the data will update to reflect the new categorization. As a user of the site, you may notice historic numbers changing over time.
Violent and Property Offenses Only Include UCR Part I Offenses
Offenses classified on this website as “violent” or “property” only include violent and property offenses classified as “Part I” by the FBI Uniform Crime Reporting system. As a practical matter, that means that simple assaults, basic weapon possession offenses, and sexual offenses other than rape are not specifically enumerated in our reports.
Non-Fatal Shooting Information Only for Shootings Since 2015
Information about Non-Fatal Shootings is based on the Open Data Philly Shooting Victims dataset which only includes shooting victims since 2015. That means that all information about shooting victims (arrests, cases charged, outcomes, etc…) only reflects shooting incidents since 1/1/2015. As an example, the Case Outcomes report shows very few cases resolved in 2015 and a very high rate of non-convictions. The reason for this is that very few shooting incidents that occured in 2015 were also resolved in 2015; those that were resolved that quickly were often quickly dismissed.
Not All of the Reports Can Be Recreated Using the Available CSV Files
We have made daily-summary CSV files available to the public so that the public can discover new insights into the criminal justice system and our work. Because these files do not include case-level information, but rather are summaries of all of the cases/arrests on a day, it is not always possible to recreate the information contained in the reports using the CSV files.
Where are Race and Gender?
There are numerous features that we would like to make available in the future, such as the ability to view data based on demographics of the accused and victims, like race and gender. We aim to continually improve this website to add more data, features, and interactivity in the future.

In addition, the District Attorney’s Office (DAO) is presently working with criminal justice partners in Philadelphia to understand the limitations of current race and gender data, and to analyze what is available.
Reports Generated Automatically
The reports on this website are generated automatically. Sometimes errors will appear before we find them. Please let us know about any errors so that we can fix them quickly.
Arrests Prior to 2012
The data used to create this dashboard includes all arrests that took place since 1/1/2012 and all cases that had hearings since 1/1/2012. As a general matter, this should not affect any of our statistics. Cases that began with an arrest prior to 2012 will be classified as “other offenses” since the arrest classification was not available to our data.
Exonerations not Categorized as Such
Cases that are overturned after appeal are often overturned years after the arrest. Because we can only classify arrests from 2012 forward, it is impossible to classify the offense type of most overturned cases. Rather than misclassify these cases as “Other” we decided to not include this categorization. These cases still appear in our data, generally categorized as “Withdrawn” cases.
Arresting Agency Generally the Philadelphia Police Department
Generally the Philadelphia Police is the arresting agency, but our data includes other police agencies such as Southeastern Pennsylvania Transportation Authority (SEPTA) police.
Declination Rate Cannot Be Calculated
The DAO’s declination rate (the rate at which we decline to charge arrests) cannot be calculated from the data on this website. This is because declination is more complicated than comparing arrests to cases charged. For example, it is common for the DAO to open multiple cases based on a single arrest. The DAO can also decline and ask the police to gather more evidence before charging an individual case. If the person is later rearrested and charged, the prior declination will be lost.
Life in Prison and Death Sentences
When calculating future years of incarceration, we assume that person a sentenced to a life sentence or a death sentence will live to their normal life expectancy using the 2015 actuarial table from the Social Security Administration. For example, a 25-year-old male sentenced to life in prison would be calculated to have a sentence of 52.3 years.
Diversion Outcomes Only Reflect People Who Have Completed Diversion
We made a decision to only include people who have completed their diversion program in our data, rather than people who started but never completed. This means that the District Attorney’s Office diverts an even larger number of cases each year, but individuals may not complete their assigned program.
Incident Reporting is Preliminary and is not Authoritative

The District Attorney’s Office does not have data on crime incidents in the City other than what is available to the public through the City of Philadelphia’s open data program. We use OpenDataPhilly’s Parts I & II Crime Incidents dataset to study crime events, many of which do not lead to arrests. This leads to a number of limitations:

  • This database only records crimes classified as Part I or Part II crimes by the FBI, and does not include all violations. This means that not all crimes that happen within the City are shown in our Incident report.

  • The incidents recorded in this dataset are given a generalized of the FBI Uniform Crime Reporting (UCR) code. We cannot determine the specific nature offenses from the data available. For example, all Thefts are recorded as “Theft” in the Incident Report, while we can separate “Theft” from “Retail Theft” in all other reports. Another example of this is that all drug violations are recorded as simply Drug Offenses, and we cannot separate them into possession and sales as we do in all other reports.

  • This dataset is intended to allow members of the public, community groups, researchers, and other departments to access up-to-date and actionable data on crime incidents in Philadelphia. It is not intended to be an authoritative crime reporting database. The statistics generated from this database may not reflect the official reporting by the Philadelphia Police Department (PPD).

  • As a result of the point above, it is possible to derive three different numbers for Homicides from Police sources. We have summarized the OpenDataPhilly Incidents and derived the total number of incidents that have been classified in that data as Homicides. This is often different than the number reported on the front page of the Police Crime Maps & Stats page. Another number for homicides can be found in the Police’s weekly Crime Stats Reports; this number is also often different than those found in the Incident Data and Crime Maps & Stats page.

    We have decided to include the Incident Report to help those who view the dashboard get as complete a view of the criminal justice system as possible, starting from the incident to the future years of supervision or incarceration generated.

  • Released on Cash Bail
    We currently only report the amount of bail initial assigned to a case, not whether it was paid or later modified.
    SOB and Cash Bail
    On a modest number of older cases, we know the amount of bail that was set, but we don’t know whether that bail was set as Cash bail (requiring the person to pay in order to be released) or SOB (allowing the person to be released without paying any money). In these situations we assume that the person is required to pay Cash bail.
    Geographic Information is Limited
    Due to the quality of the data available, our ability to locate incidents and associated arrest and case information throughout the City is currently limited to what we can connect from public data sources. This affects what data we can deliver for download. Data by Police District is the most complete, but data independent from Police Districts, such as Census Tracts, may be incomplete because we’re missing precise location information that allows us to map an incident to a geography. We estimate that approximately 8% of all arrest and case information is classified as Unknown. Census Tract information is limited to weekly due to file size limitations.
    Downloadable ZIP Code data is by ZIP Code Tabulation Area (ZCTA)

    The data by ZIP code available for download is actually by ZIP Code Tabulation Area, a geography defined by the United States Census Bureau using the majority ZIP Code for addresses in Census blocks. In densely populated cities like Philadelphia, ZCTAs and postal ZIP codes are functionally identical. For more information, visit the U.S. Census Bureau’s page on ZIP Code Tabulation Areas.

    We use ZCTAs in the downloadable sets for two reasons:

    • ZCTAs allow for easy and clean connection to U.S. Census data.
    • The location data of the incident tends to be more reliable than the ZIP Code in the Court information. Currently, Court data contains a large number of missing, incomplete, wrong, or otherwise bad ZIP Codes. This is likely due to the data being human-entered in the field, where one might not know what ZIP code they are currently in.