How to properly extract data from text file. (2024)

1 view (last 30 days)

Show older comments

Sharah on 11 May 2017

Link

Direct link to this question

https://www.mathworks.ru/matlabcentral/answers/339849-how-to-properly-extract-data-from-text-file

⋮

Link

Direct link to this question

https://www.mathworks.ru/matlabcentral/answers/339849-how-to-properly-extract-data-from-text-file

Commented: Guillaume on 12 May 2017

Accepted Answer: dpb

Open in MATLAB Online

I have a data in a text file that looks basically like this:

LineType: 1

PlayMode: Single

GameType: OneBalloon

LineType: SumR3

TranslationSpeed: 0

SensivityBalloon1: 0.09

SensivityBalloon2: 0

LevelLength: 20

Season: Summer

Backgrounddifficulty: Easy

StarScore[1] DistanceScore[1] StabilityScore[1] ScoreFrames[1] Frame[1] Time[1] ForcePlayer1[1] BalloonPath_X[1] BalloonPath_Y[1] CharacterPath_X[1] CharacterPath_Y[1] IsInactive[1]

0 0 0 0 0 0 30653 0 4.225888 0 2.150741 0

1 0 0 0 1 0 30641 0 -2.579402 0 -4.643577 0

And I am using this to extract data starting from the StarScore:

file = fullfile('file.txt');

Subject(1).T = readtable(file,'Delimiter',' ', ...

'ReadVariableNames',true, 'HeaderLines', 10);

Subject(1).T(:, 13) = [];

Two questions I have:

1) The problem with this is that, the headerline should be at 11, but MATLAB extracted the first data as the header if I put HeaderLines to 11. It skips the first line. Why?

2) How to extract the first few information from the text file on a different cell and stop before it reaches starScore?

0 Comments
Show -2 older commentsHide -2 older comments

Accepted Answer

dpb on 11 May 2017

Link

Direct link to this answer

https://www.mathworks.ru/matlabcentral/answers/339849-how-to-properly-extract-data-from-text-file#answer_266694

⋮

Link

Direct link to this answer

https://www.mathworks.ru/matlabcentral/answers/339849-how-to-properly-extract-data-from-text-file#answer_266694

Edited: dpb on 12 May 2017

Open in MATLAB Online

6 Comments
Show 4 older commentsHide 4 older comments

Sharah on 12 May 2017

Direct link to this comment

https://www.mathworks.ru/matlabcentral/answers/339849-how-to-properly-extract-data-from-text-file#comment_453314

⋮

Link

Direct link to this comment

https://www.mathworks.ru/matlabcentral/answers/339849-how-to-properly-extract-data-from-text-file#comment_453314

Open in MATLAB Online

You do have any idea why when I do:

Subject(1).T = readtable(file,'Delimiter',' ', ...

'ReadVariableNames',true, 'HeaderLines', 10);

there's a 13th column of data of NaN? I only have 12 data. I need to put

Subject(1).T(:, 13) = [];

in order to get rid of this.

dpb on 12 May 2017

Direct link to this comment

https://www.mathworks.ru/matlabcentral/answers/339849-how-to-properly-extract-data-from-text-file#comment_453320

⋮

Link

Direct link to this comment

https://www.mathworks.ru/matlabcentral/answers/339849-how-to-properly-extract-data-from-text-file#comment_453320

Open in MATLAB Online

That appears to be a fignewton of the fact there is an extra blank at the end of the data lines and the delimiter is blank as well.

Removing that from the header line and all data lines fixed it; if miss either then get mismatched length error.

Error using readtable (line 129)

Cannot interpret data in the file 'sarah.dat'.

Found 12 variable names but 13 data columns. ...

Guillaume on 12 May 2017

Direct link to this comment

https://www.mathworks.ru/matlabcentral/answers/339849-how-to-properly-extract-data-from-text-file#comment_453331

⋮

Link

Direct link to this comment

https://www.mathworks.ru/matlabcentral/answers/339849-how-to-properly-extract-data-from-text-file#comment_453331

Adding the option 'MultipleDelimsAsOne', true to the readtable call would probably fix the 13th column issue.

Sharah on 12 May 2017

Direct link to this comment

https://www.mathworks.ru/matlabcentral/answers/339849-how-to-properly-extract-data-from-text-file#comment_453335

⋮

Link

Direct link to this comment

https://www.mathworks.ru/matlabcentral/answers/339849-how-to-properly-extract-data-from-text-file#comment_453335

Edited: Sharah on 12 May 2017

'MultipleDelimsAsOne' to true does not fix the issue. This is quite weird as if I use 'textscan' this won't happen. I need to use the 'readtable' function in this case as I need the original header from the text file (as the arrangement of data is not fixed, i.e. sometime Time will come in first column'

dpb on 12 May 2017

Direct link to this comment

https://www.mathworks.ru/matlabcentral/answers/339849-how-to-properly-extract-data-from-text-file#comment_453362

⋮

Link

Direct link to this comment

https://www.mathworks.ru/matlabcentral/answers/339849-how-to-properly-extract-data-from-text-file#comment_453362

Edited: dpb on 12 May 2017

Open in MATLAB Online

No, because there aren't two blanks trailing but one (and even if were, they would be reduced from whatever number there were to just same one).

The "problem" is there is a 12 th delimiter in the record which indicates there are 13 fields; there just is no subsequent data for that field. It's no different than a .csv file being terminated by a trailing ','.

I see same effect in textscan --

>> T=textscan(fid,'','Delimiter',' ', ...

'headerlines',11,'collectoutput',1)

T =

[2x13 double]

Since the file is malformed, either

fix it (eliminate the trailing blank), or
clean up the input after read it.

Given it's such a trivial fix to eliminate column that is all NaN, that's probably the simplest thing to do.

ADDENDUM

Actually, the symptom is pretty common and comes from something like

[nr,nc]=size(array); % size of array to write to file

fmt=[repmat('%f ',1,nc) '\n']; % nc fields and newline

fprintf(fid,array.') % write the array

While that seems superficially "the right stuff", by using the total number of columns in the repmat call on the numeric fields with the blank delimiter after each, you have created the Frankenstein you're now trying to read of the extra delimiter.

It's trivial to fix, however, assuming you have control over creating the file--if somebody else did it, then you have to have them fix it for you or deal with it after the fact, unfortunately.

All that's need is to just modify the format string just a wee bit...

fmt=[repmat('%f ',1,nc-1) '%f\n'];

where the delimiter is written for all the columns except the last, whose field is just the field string followed directly by the newline.

Guillaume on 12 May 2017

Direct link to this comment

https://www.mathworks.ru/matlabcentral/answers/339849-how-to-properly-extract-data-from-text-file#comment_453482

⋮

Link

Direct link to this comment

https://www.mathworks.ru/matlabcentral/answers/339849-how-to-properly-extract-data-from-text-file#comment_453482

Open in MATLAB Online

Alternatively,

fmt = strjoin(repmat({'%f'}, 1, nc), ' ');

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

An Error Occurred

Unable to complete the action because of changes made to the page. Reload the page to see its updated state.

Select a Web Site

Choose a web site to get translated content where available and see local events and offers. Based on your location, we recommend that you select: .

You can also select a web site from the following list

Americas

América Latina (Español)
Canada (English)
United States (English)

Europe

Belgium (English)
Denmark (English)
Deutschland (Deutsch)
España (Español)
Finland (English)
France (Français)
Ireland (English)
Italia (Italiano)
Luxembourg (English)

Netherlands (English)
Norway (English)
Österreich (Deutsch)
Portugal (English)
Sweden (English)
Switzerland
United Kingdom(English)

Asia Pacific

Australia (English)
India (English)
New Zealand (English)
中国
- 简体中文Chinese
- English
日本Japanese (日本語)
한국Korean (한국어)

Contact your local office

How to properly extract data from text file. (2024)

Direct link to this question

Direct link to this question

0 Comments
Show -2 older commentsHide -2 older comments

Accepted Answer

Direct link to this answer

Direct link to this answer

6 Comments
Show 4 older commentsHide 4 older comments

Direct link to this comment

Direct link to this comment

Direct link to this comment

Direct link to this comment

Direct link to this comment

Direct link to this comment

Direct link to this comment

Direct link to this comment

Direct link to this comment

Direct link to this comment

Direct link to this comment

Direct link to this comment

More Answers (1)

Direct link to this answer

Direct link to this answer

0 Comments
Show -2 older commentsHide -2 older comments

See Also

Categories

Tags

Community Treasure Hunt

References

How to properly extract data from text file. (2024)

Direct link to this question

Direct link to this question

0 Comments Show -2 older commentsHide -2 older comments

Accepted Answer

Direct link to this answer

Direct link to this answer

6 Comments Show 4 older commentsHide 4 older comments

Direct link to this comment

Direct link to this comment

Direct link to this comment

Direct link to this comment

Direct link to this comment

Direct link to this comment

Direct link to this comment

Direct link to this comment

Direct link to this comment

Direct link to this comment

Direct link to this comment

Direct link to this comment

More Answers (1)

Direct link to this answer

Direct link to this answer

0 Comments Show -2 older commentsHide -2 older comments

See Also

Categories

Tags

Community Treasure Hunt

References

0 Comments
Show -2 older commentsHide -2 older comments

6 Comments
Show 4 older commentsHide 4 older comments

0 Comments
Show -2 older commentsHide -2 older comments