In 2017 the Mexican fiscal authority Sistema de Administración Tributaria (SAT) published an extensive catalog of products and services (with over fifty two thousand entries). By law, digital tax receipts ("CFDI's") need to include the codes contained in the catalog.
Due to its size and ad hoc nature, incorporating the catalog into an existing point of sale software is a challenge. I used Wolfram Language data and text analysis capabilities to generate a database suitable for the pharmaceutics industry.
SAT_database_explorer.nballows to explore the pharmaceutics part of the SAT Catalog using a dataset in Wolfram Language format.pharmaceutics_SAT.cfdsummarizes classes and subclasses of pharmaceutics according to the SAT Catalog.SAT_data_analysis.nblogs the steps used to convert, analyze and generate a dataset in Wolfram Language format.
Directory data contains the relevant data in various formats. The original catalog is SAT_catalog.xls. The files with the .wdx extension store the datasets in Wolfram Language format.