module12=pd.read_table(‘/Users/mojaeu/Desktop/303_length_genes.txt’) #only gene names
len(module12)=303
ya12=list(module12.Gene.values)
wcc=set(data_lung.var_names).intersection(ya12)
data_lung_edit = data_lung[:,list(wcc)]
data_lung_edit.var_names
len(data_lung_edit.var_names )=181
The number of genes I want to use for analysis is 303, but if you use the code above to find an intersection, only 181 genes are used for analysis. (seek an intersection with adata 9999 genes)
But, genes that are not included in the intersection are expressed in UMAP.(ex…CDH1 gene)
May I know why this is happening?
Thanks.