Modify and extract information from large text files (16/10/2019)

This training session is organized by the Genotoul bioinfo platform and aims at understanding how to efficiently process large raw or result text files. You will learn how to add a header line or change parts of a file without having the make a copy. You will learn how to extract columns and perform simple figure manipulations on files with millions of lines. A large share of the time will be spent practicing with sed and awk.



This training is focused on practice. It consists of 3 modules with a large variety of exercises:
You are going to learn how to manipulate large file.

  • 09:00 am to 10:30 am : Regular expressions – TP1
  • 10:45 am to 12:30 pm : File editing with sed – TP2
  • 14:00 pm to 17:00 pm : Extracting information with awk – Combining unix sed and awk within pipes – TP3


The session will take place in the room ‘salle de formation’ at INRA center of Toulouse-Auzeville.


Prerequisites: ability to use a Linux environment (see Linux training). Training materials will be available on the website before the session. Slides in a “taking notes” format will be downloadable from our web site. A Unix reference command leaflet will also be provided. Only the latter will be available during the session.




Bookings: Modify and extract information from large text files

This event is fully booked.