Molecular Data#

The module molecular_data provides functions related to Molecular Data section of cBioPortal Web Public API.

pybioportal.molecular_data.fetch_all_molecular_data_in_molecular_profile(molecular_profile_id, entrez_gene_ids=None, sample_ids=None, sample_list_id=None, projection='SUMMARY')#

Fetch molecular data in a molecular profile for a list of genes.

Parameters:
  • molecular_profile_id (str) – Molecular Profile ID (e.g., “acc_tcga_rna_seq_v2_mrna”).

  • entrez_gene_ids (list of str) – List of Entrez Gene IDs (e.g., [“672”,”675”]).

  • sample_ids (list of str) – List of Sample IDs (e.g., [“TCGA-AR-A1AR-01”,”TCGA-BH-A1EO-01”]).

  • sample_list_id (str) – Sample List ID (e.g., “brca_tcga_all”).

  • projection (str) –

    Level of detail of the response.

    Possible values:

    • ”DETAILED”: Detailed information.

    • ”ID”: Information with only IDs.

    • ”META”: Metadata information.

    • ”SUMMARY”: Summary information (default).

Returns:

A DataFrame containing molecular data for the specified genes.

Return type:

pandas.DataFrame

pybioportal.molecular_data.fetch_molecular_data(entrez_gene_ids=None, molecular_profile_ids=None, sample_molecular_identifiers=None, projection='SUMMARY')#

Fetch molecular data.

Parameters:
  • entrez_gene_ids (list of str) – List of Entrez Gene IDs (e.g., [“672”, “675”]).

  • molecular_profile_ids (list of str) – List of MolecularProfile IDs (e.g., [“brca_tcga_mrna”, “acc_tcga_rna_seq_v2_mrna”]).

  • sample_molecular_identifiers – List of Molecular Profile ID and Sample ID pairs.

  • projection (str) –

    Level of detail of the response.

    Possible values:

    • ”DETAILED”: Detailed information.

    • ”ID”: Information with only IDs.

    • ”META”: Metadata information.

    • ”SUMMARY”: Summary information (default).

Returns:

A DataFrame containing molecular data.

Return type:

pandas.DataFrame

pybioportal.molecular_data.get_all_molecular_data_in_molecular_profile(molecular_profile_id, sample_list_id, entrez_gene_id, projection='SUMMARY')#

Get all molecular data in a molecular profile for a specific gene.

Parameters:
  • molecular_profile_id (str) – Molecular Profile ID (e.g., “acc_tcga_rna_seq_v2_mrna”).

  • sample_list_id (str) – Sample List ID (e.g., “acc_tcga_all”).

  • entrez_gene_id (str) – Entrez Gene ID (e.g., “1”).

  • projection (str) –

    Level of detail of the response.

    Possible values:

    • ”DETAILED”: Detailed information.

    • ”ID”: Information with only IDs.

    • ”META”: Metadata information.

    • ”SUMMARY”: Summary information (default).

Returns:

A DataFrame containing molecular data for the specified gene.

Return type:

pandas.DataFrame


Examples#

from pybioportal import molecular_data as md
df1a = md.fetch_molecular_data(entrez_gene_ids=["672","675"],
                               molecular_profile_ids=["brca_tcga_mrna", "acc_tcga_rna_seq_v2_mrna"])
df1a
uniqueSampleKey uniquePatientKey entrezGeneId molecularProfileId sampleId patientId studyId value
0 VENHQS1PUi1BNUoxLTAxOmFjY190Y2dh VENHQS1PUi1BNUoxOmFjY190Y2dh 672 acc_tcga_rna_seq_v2_mrna TCGA-OR-A5J1-01 TCGA-OR-A5J1 acc_tcga 40.345800
1 VENHQS1PUi1BNUoxLTAxOmFjY190Y2dh VENHQS1PUi1BNUoxOmFjY190Y2dh 675 acc_tcga_rna_seq_v2_mrna TCGA-OR-A5J1-01 TCGA-OR-A5J1 acc_tcga 15.850100
2 VENHQS1PUi1BNUoyLTAxOmFjY190Y2dh VENHQS1PUi1BNUoyOmFjY190Y2dh 672 acc_tcga_rna_seq_v2_mrna TCGA-OR-A5J2-01 TCGA-OR-A5J2 acc_tcga 177.560000
3 VENHQS1PUi1BNUoyLTAxOmFjY190Y2dh VENHQS1PUi1BNUoyOmFjY190Y2dh 675 acc_tcga_rna_seq_v2_mrna TCGA-OR-A5J2-01 TCGA-OR-A5J2 acc_tcga 29.392000
4 VENHQS1PUi1BNUozLTAxOmFjY190Y2dh VENHQS1PUi1BNUozOmFjY190Y2dh 672 acc_tcga_rna_seq_v2_mrna TCGA-OR-A5J3-01 TCGA-OR-A5J3 acc_tcga 47.992900
... ... ... ... ... ... ... ... ...
1211 VENHQS1FMi1BMTVBLTA2OmJyY2FfdGNnYQ VENHQS1FMi1BMTVBOmJyY2FfdGNnYQ 675 brca_tcga_mrna TCGA-E2-A15A-06 TCGA-E2-A15A brca_tcga -1.171750
1212 VENHQS1FMi1BMTVLLTA2OmJyY2FfdGNnYQ VENHQS1FMi1BMTVLOmJyY2FfdGNnYQ 672 brca_tcga_mrna TCGA-E2-A15K-06 TCGA-E2-A15K brca_tcga -0.918167
1213 VENHQS1FMi1BMTVLLTA2OmJyY2FfdGNnYQ VENHQS1FMi1BMTVLOmJyY2FfdGNnYQ 675 brca_tcga_mrna TCGA-E2-A15K-06 TCGA-E2-A15K brca_tcga -0.193000
1214 VENHQS1FMi1BMTVFLTA2OmJyY2FfdGNnYQ VENHQS1FMi1BMTVFOmJyY2FfdGNnYQ 672 brca_tcga_mrna TCGA-E2-A15E-06 TCGA-E2-A15E brca_tcga -1.898417
1215 VENHQS1FMi1BMTVFLTA2OmJyY2FfdGNnYQ VENHQS1FMi1BMTVFOmJyY2FfdGNnYQ 675 brca_tcga_mrna TCGA-E2-A15E-06 TCGA-E2-A15E brca_tcga -2.228000

1216 rows × 8 columns

df1b = md.fetch_molecular_data(entrez_gene_ids=["672","675"],
                               sample_molecular_identifiers=[
                                         {"molecular_profile_id": "brca_tcga_mrna",
                                          "sample_ids": ["TCGA-AR-A1AR-01","TCGA-BH-A1EO-01"]},
                                         {"molecular_profile_id": "acc_tcga_rna_seq_v2_mrna",
                                          "sample_ids": ["TCGA-OR-A5J1-01","TCGA-OR-A5J2"]}
                                         ])
df1b
uniqueSampleKey uniquePatientKey entrezGeneId molecularProfileId sampleId patientId studyId value
0 VENHQS1BUi1BMUFSLTAxOmJyY2FfdGNnYQ VENHQS1BUi1BMUFSOmJyY2FfdGNnYQ 672 brca_tcga_mrna TCGA-AR-A1AR-01 TCGA-AR-A1AR brca_tcga -1.224333
1 VENHQS1BUi1BMUFSLTAxOmJyY2FfdGNnYQ VENHQS1BUi1BMUFSOmJyY2FfdGNnYQ 675 brca_tcga_mrna TCGA-AR-A1AR-01 TCGA-AR-A1AR brca_tcga 0.027250
2 VENHQS1CSC1BMUVPLTAxOmJyY2FfdGNnYQ VENHQS1CSC1BMUVPOmJyY2FfdGNnYQ 672 brca_tcga_mrna TCGA-BH-A1EO-01 TCGA-BH-A1EO brca_tcga -1.739417
3 VENHQS1CSC1BMUVPLTAxOmJyY2FfdGNnYQ VENHQS1CSC1BMUVPOmJyY2FfdGNnYQ 675 brca_tcga_mrna TCGA-BH-A1EO-01 TCGA-BH-A1EO brca_tcga -1.380500
4 VENHQS1PUi1BNUoxLTAxOmFjY190Y2dh VENHQS1PUi1BNUoxOmFjY190Y2dh 672 acc_tcga_rna_seq_v2_mrna TCGA-OR-A5J1-01 TCGA-OR-A5J1 acc_tcga 40.345800
5 VENHQS1PUi1BNUoxLTAxOmFjY190Y2dh VENHQS1PUi1BNUoxOmFjY190Y2dh 675 acc_tcga_rna_seq_v2_mrna TCGA-OR-A5J1-01 TCGA-OR-A5J1 acc_tcga 15.850100
df2 = md.get_all_molecular_data_in_molecular_profile(molecular_profile_id="brca_tcga_rppa",
                                                     sample_list_id="brca_tcga_all",
                                                     entrez_gene_id="675")
df2
uniqueSampleKey uniquePatientKey entrezGeneId molecularProfileId sampleId patientId studyId value
0 VENHQS0zQy1BQUxJLTAxOmJyY2FfdGNnYQ VENHQS0zQy1BQUxJOmJyY2FfdGNnYQ 675 brca_tcga_rppa TCGA-3C-AALI-01 TCGA-3C-AALI brca_tcga -0.181730
1 VENHQS0zQy1BQUxLLTAxOmJyY2FfdGNnYQ VENHQS0zQy1BQUxLOmJyY2FfdGNnYQ 675 brca_tcga_rppa TCGA-3C-AALK-01 TCGA-3C-AALK brca_tcga -0.366790
2 VENHQS00SC1BQUFLLTAxOmJyY2FfdGNnYQ VENHQS00SC1BQUFLOmJyY2FfdGNnYQ 675 brca_tcga_rppa TCGA-4H-AAAK-01 TCGA-4H-AAAK brca_tcga -0.108050
3 VENHQS01TC1BQVQxLTAxOmJyY2FfdGNnYQ VENHQS01TC1BQVQxOmJyY2FfdGNnYQ 675 brca_tcga_rppa TCGA-5L-AAT1-01 TCGA-5L-AAT1 brca_tcga -0.785290
4 VENHQS01VC1BOVFBLTAxOmJyY2FfdGNnYQ VENHQS01VC1BOVFBOmJyY2FfdGNnYQ 675 brca_tcga_rppa TCGA-5T-A9QA-01 TCGA-5T-A9QA brca_tcga -0.469280
... ... ... ... ... ... ... ... ...
887 VENHQS1XOC1BODZHLTAxOmJyY2FfdGNnYQ VENHQS1XOC1BODZHOmJyY2FfdGNnYQ 675 brca_tcga_rppa TCGA-W8-A86G-01 TCGA-W8-A86G brca_tcga -0.083261
888 VENHQS1XVC1BQjQxLTAxOmJyY2FfdGNnYQ VENHQS1XVC1BQjQxOmJyY2FfdGNnYQ 675 brca_tcga_rppa TCGA-WT-AB41-01 TCGA-WT-AB41 brca_tcga -0.386950
889 VENHQS1YWC1BODk5LTAxOmJyY2FfdGNnYQ VENHQS1YWC1BODk5OmJyY2FfdGNnYQ 675 brca_tcga_rppa TCGA-XX-A899-01 TCGA-XX-A899 brca_tcga -0.472410
890 VENHQS1aNy1BOFI1LTAxOmJyY2FfdGNnYQ VENHQS1aNy1BOFI1OmJyY2FfdGNnYQ 675 brca_tcga_rppa TCGA-Z7-A8R5-01 TCGA-Z7-A8R5 brca_tcga -0.458130
891 VENHQS1aNy1BOFI2LTAxOmJyY2FfdGNnYQ VENHQS1aNy1BOFI2OmJyY2FfdGNnYQ 675 brca_tcga_rppa TCGA-Z7-A8R6-01 TCGA-Z7-A8R6 brca_tcga -0.382420

892 rows × 8 columns

df3 = md.fetch_all_molecular_data_in_molecular_profile(molecular_profile_id = "brca_tcga_rppa",
                                                       entrez_gene_ids = ["672","675"],
                                                       sample_ids = ["TCGA-AR-A1AR-01","TCGA-BH-A1EO-01"])
df3
uniqueSampleKey uniquePatientKey entrezGeneId molecularProfileId sampleId patientId studyId value
0 VENHQS1CSC1BMUVPLTAxOmJyY2FfdGNnYQ VENHQS1CSC1BMUVPOmJyY2FfdGNnYQ 675 brca_tcga_rppa TCGA-BH-A1EO-01 TCGA-BH-A1EO brca_tcga -0.40763