PROVESID¶
PROVESID is a Python package for chemical identifier resolution, property retrieval, and local dataset lookup across multiple chemistry data systems.
What PROVESID Covers¶
PROVESID includes interfaces for online APIs and local/offline databases.
Online interfaces¶
PubChemAPIfor PUG-REST queries (compound, substance, assay, and identifier workflows)PubChemViewfor experimental property extraction with references and tabular outputsNCIChemicalIdentifierResolverfor identifier conversion through NCI resolver endpointsCASCommonChemfor CAS Common Chemistry lookupsChEBIfor ChEBI API accessOPSINandPYOPSINfor chemical name-to-structure conversionClassyFireAPIfor chemical taxonomy/classification
Offline and local dataset interfaces¶
CheMBLlocal SQLite interface (auto-download supported)PubChemIDlocal SQLite identifier database (CID/CAS/InChI/InChIKey and metadata)CompToxIDlocal SQLite interface (auto-download supported)ZeroPMlocal SQLite interface (auto-download supported)REACHDossierIDlocal REACH dossier dataset lookupChebiSDFlocal ChEBI SDF parser
Recommended Installation Method¶
Use uv as the primary installation workflow:
uv pip install provesid
For development from source:
git clone https://github.com/USEtox/PROVESID.git
cd PROVESID
uv pip install -e .
uv is recommended because PROVESID can work with large local data files and database assets. Using uv helps avoid repeated package/data copies across many environments.
Start Here¶
Tutorials¶
- PubChem Tutorial
- CAS Common Chemistry Tutorial
- ChEBI Tutorial
- ChEBI SDF Tutorial
- ClassyFire Tutorial
- OPSIN Tutorial
- Chemical ID Resolver Tutorial
- ChEMBL Tutorial
- PubChem View Tutorial
- ZeroPM Tutorial