Documentation request, import formats for TPM_txt files

I have a TPM file that I am importing in my colab script using pandas and scanpy. The txt file starts with 1 row X 55737 columns or another way around. Scanpy/panda is considering the first-row value as a heading. How to read TXT?
I have tried multiple ways to read the files such as

data1 = pd.read_csv(GSE120575_Sade_Feldman_melanoma_single_cells_TPM_GEO.txt’, sept=’\t’)
1 rows × 55737 columns
test1 = pd.read_csv(‘GSE120575_Sade_Feldman_melanoma_single_cells_TPM_GEO.txt’)

adata = sc.read_text(GSE120575_Sade_Feldman_melanoma_single_cells_TPM_GEO.txt).transpose()

adata = sc.read(GSE120575_Sade_Feldman_melanoma_single_cells_TPM_GEO.txt, ext='txt').transpose() 

Warning: Total number of columns (55737) exceeds max_columns (20) limiting to first (20) columns.

same as this file formate too “PP001swap.filtered.matrix.txt”
the file was downloaded from the below links :

  1. GEO Accession viewer
  2. GEO Accession viewer
    I really appreciate your help.

I am very new to bioinformatics/scanpy

Hi,

The file you are trying to read in the screenshot is a ‘tab separated values’ (TSV) file. The ‘borders’ between entries in the table are separated by tab characters ('\t'), while the pd.read_csv() function assumes that entries are separated by comma characters (','). So for the pd.read_csv() function it looks like there is just one table entry on each row. You can tell pd.read_csv() to separate values by tab characters instead by doing e.g. pd.read_csv(file_path, sep = '\t').

/Valentine

1 Like

Thank you @Valentine_Svensson