#Linux : extract substring in #Bash

One of the most important tasks in dealing with data is string manipulation. We already saw how to use awk and grep to efficiently sift through text files using command line tools instead of developing ad-hoc code. To step it up a notch, we can also do some heavier preprocessing of the data, such as selecting only the subset of information that matches a particular pattern, to ensure data coming out of our pipeline is of good quality.

In this case, we use a Bash feature called parameter expansion. Let’s assume we have the text data in a variable TEXT_LINE and an expression pattern to match (in file-name matching format), this is a summary of the possible expansion:

Delete shortest match of pattern from the beginning
${TEXT_LINE#pattern}
Delete longest match of pattern from the beginning
${TEXT_LINE##pattern}
Delete shortest match of pattern from the end
${TEXT_LINE%pattern}
Delete longest match of pattern from the end
${TEXT_LINE%%pattern}
Get substrings based on position using numbers
${TEXT_LINE:START:END}
Replace particular strings or patterns
${TEXT_LINE/pattern/replace}

So for example, to extract only the file name without the extension:

${TEXT_LINE%.*}

or to extract user name from an email:

${TEXT_LINE%%@*.*}

or extract the file name from an absolute path:

${TEXT_LINE##*/}

NOTE: You can’t combine two operations, instead you have to assign to an intermediate variable.

About whitehatty

Computer Engineer and Scientist interested in Computer Security, Complex Networks, Math, Biology and Medicine. "Think Different" life style. Quake 3 Arena player. NERD by DNA.

View all posts by whitehatty »

Posted on February 11, 2019, in Linux and tagged command line tools, Data Processing, linux, script bash. Bookmark the permalink. Leave a comment.

Leave a comment
Comments 0

	Keith on #WP7 : Enable Hidden MFG …
	Andrea on #MacOSX : Make your Mac a Wire…
	Miles on #MacOsX : Disable Auto-Save an…
	Net Eng on #MacOSX : Make your Mac a Wire…

whitehatty

#TheOnlyLimitIsTheOneYouSetYourself

#Linux : extract substring in #Bash

About whitehatty

Leave a comment

Comments 0

Leave a comment Cancel reply

Top Posts & Pages

Recent Comments

Categories

Random

whitehatty

#TheOnlyLimitIsTheOneYouSetYourself

#Linux : extract substring in #Bash

Share The Knowledge:

Related

About whitehatty

Leave a comment

Comments 0

Leave a comment Cancel reply

Top Posts & Pages

Recent Comments

Categories

Random