Problems using parameters in a STARTed Batch file

Discussion forum for all Windows batch related topics.

Moderator: DosItHelp

Post Reply
Message
Author
Aacini
Expert
Posts: 1921
Joined: 06 Dec 2011 22:15
Location: México City, México
Contact:

Problems using parameters in a STARTed Batch file

#1 Post by Aacini » 05 Jul 2024 07:25

I discovered a couple problems when I want to use special and extended characters in parameters in a Batch file started with START command. This is the test.bat Batch file:

Code: Select all

@echo %*
... that works perfectly when it is used this way:

Code: Select all

start "Title" test.bat First Second
... but I want to use special characters in the parameters, so I need to enclose they in quotes:

Code: Select all

start "Title" test.bat "First >" "Second |"
Ok. Now, I need to enclose the Batch file name in quotes because its path will contain spaces. Lets start with a test with no special characters first:

Code: Select all

start "Title" "test.bat" First Second
... that works OK. However, if I enclose the parameters in quotes (because they will contain special characters) the problems begins. This command:

Code: Select all

start "Title" "test.bat" "First" "Second"
... show this output in the started cmd.exe window:

Code: Select all

"test.bat"  "First" "Second" is not recognized as an external or internal command, etc...
After several tests, the only way I found to fix this point is include a CMD /K command after the START command AND insert an additional pair of quotes at the very beginning and end of the whole command, that is:

Code: Select all

start "Title" cmd /K ""test.bat" "First" "Second""
... that correctly works again.

However, if I include the special characters in the parameters:

Code: Select all

start "Title" cmd /K ""test.bat" "First >" "Second |""
... the problems returns... :(

After several tests I discovered that the only way to include special characters in the parameters in this way is escaping the special characters:

Code: Select all

start "Title" cmd /K ""test.bat" "First ^>" "Second ^|""
... that is not a convenient solution. :?

:arrow: First question: is there any better way to include special characters in the parameters of a Batch file started this way?

Ok. If the problem is the characters included in the parameters, then just eliminate the parameters. I changed the test.bat file for this line:

Code: Select all

@echo %parameters%
... and executed it this way:

Code: Select all

set parameters="First >" "Second |"
start "Title" cmd /K "test.bat"
... that correctly show "First >" "Second |" in the started cmd.exe window.

===================================================================================

Ok. Lets pass now to the second problem. I want that the parameters include extended (beyond ASCII 127) characters:

Code: Select all

set parameters="ÇüéâäàåçêëèïîìÄÅ" "áíóúñÑ¿¡"
start "Title" cmd /K "test.bat"
This test correctly show the extended characters in the started cmd.exe window. However, this test was executed from the keyboard. Of course, I want to use the test.bat file from another Batch file, so I prepared it:

Code: Select all

@echo off
set parameters="ÇüéâäàåçêëèïîìÄÅ" "áíóúñÑ¿¡"
start "Title" cmd /K "test.bat"
When I executed this second.bat file the started cmd.exe window show characters, but the characters are different:

Code: Select all

"├ç├╝├®├ó├ñ├á├Ñ├º├¬├½├¿├»├«├¼├ä├à" "├í├¡├│├║├▒├æ┬┐┬í"
I thought at first that I found the problem: both test.bat and second.bat were saved with the default Windows Notepad encoding: UTF-8. This point had caused me several problems in the past...

So I saved both files with ANSI encoding and checked that the characters in second.bat file be correct. Anyway, the characters shown ARE NOT THE SAME! :shock:

Code: Select all

"óÚÔõÓÕþÛÙÞ´¯ý─┼" "ßݾ·±Ð┐í"
:arrow: What happens here? How can I show the same extended characters of the calling Batch file in the called one?

Antonio

aGerman
Expert
Posts: 4684
Joined: 22 Jan 2010 18:01
Location: Germany

Re: Problems using parameters in a STARTed Batch file

#2 Post by aGerman » 05 Jul 2024 14:22

Hi Antonio.

1) There is a double parsing in place.
The caller parses the the quotes in the line as pairs. Only the characters within a pair are secured (if any :wink:).
start "title" cmd /c ""script.bat" "arg1""

The syntax of the CMD /K (or /C) execution however requires the additional outer pair of quotes. The quoted tokens inside are still secured. (FWIW This proves that the cmd has a homebrew command line parsing.)
cmd /c ""script.bat" "arg1""

So, this conflicts. The solution is to only escape the outer quotes for the caller process. (The carets don't reach out to the second cmd process.)
start "title" cmd /c ^""script.bat" "arg1"^"


2) Actually the only way to process non-ASCII characters in a portable way in batch is to use UTF-8 (without BOM) everywhere. This "everywhere" means that you not only have to save the script UTF-8 encoded, but also that you have to tell the conhost process that it has to parse the whole I/O UTF-8 encoded. The latter only requires a
>NUL CHCP 65001
at the early beginning of the scripts.


caller.bat (UTF-8 no BOM)

Code: Select all

@echo off
>nul chcp 65001
start "Title" cmd /K ^""test.bat" "First >" "Second |" "ÇüéâäàåçêëèïîìÄÅ" "áíóúñÑ¿¡"^"

test.bat (UTF-8 no BOM)

Code: Select all

@echo off
>nul chcp 65001
echo %*
Steffen

Aacini
Expert
Posts: 1921
Joined: 06 Dec 2011 22:15
Location: México City, México
Contact:

Re: Problems using parameters in a STARTed Batch file

#3 Post by Aacini » 06 Jul 2024 21:37

Wow! Both solutions worked as intended! :D

The escaped additional quotes are simple and works correctly.

To correctly manage the extended characters I must include a CHCP 65001 command AND save the Batch files with UTF-8 encoding. If one of these points is not completed, the method fails...

Thanks a lot! :wink:

Antonio

aGerman
Expert
Posts: 4684
Joined: 22 Jan 2010 18:01
Location: Germany

Re: Problems using parameters in a STARTed Batch file

#4 Post by aGerman » 07 Jul 2024 11:13

It's been always required that both the encoding of the script and the code page setting of the conhost process are the same.
Remember the days where we have been used to writing something like ...

Code: Select all

@echo off

echo ÚÄÄÂÄÄ¿
echo ³  ³  ³
echo ÃÄÄÅÄÄ´
echo ³  ³  ³
echo ÀÄÄÁÄÄÙ
pause
... to draw boxes?
It took me quite a while to grasp the full extent of the fundamental wrongness of it.
Notepad and other editors didn't allow us to save the script in the default OEM code page. So we used the ANSI code page with characters having the same byte values as the box-drawing characters in the OEM code page. Thus, it was actually just a way to work around getting an OEM CP encoded script. However, this did only work for those people where the ANSI code page defaults to Windows-1252 and the OEM code page is at least a single-byte encoding (e.g. Japanese with CP 932 whould have had bad luck entirely). Writing the box drawing characters directly in the script and change the code page of conhost to CP 1252 is not possible just because those characters are not part of CP 1252. Conclusion is that the whole single byte code page stuff is crap and makes it a mess.
We have been also almost unable to do the right things though. At least not in a simple way. We should have taken a Unicode compliant encoding from the beginning. But typically editors on Windows prepended a BOM to a UTF-8 file that the command processor is still unable to handle/ignore.
However, at long last it became much easier and far more plausible now by just writing what we want to get printed ...

Code: Select all

@echo off
>nul chcp 65001
echo ┌──┬──┐
echo │  │  │
echo ├──┼──┤
echo │  │  │
echo └──┴──┘
echo 😉
pause
... and save the code UTF-8 encoded (without BOM). The emoji at the end is not a mistake. On recent Windows versions (>= Win10) we have a decent Unicode support in both console and terminal. We just need to jump on the bandwagon and make use of it :D Of course, you can even pass emojis to another script, as long as you align the encoding to UTF-8 throughout the whole chain:

Code: Select all

@echo off
>nul chcp 65001
start "Title" cmd /K ^""test.bat" "👍"^"
Steffen

einstein1969
Expert
Posts: 961
Joined: 15 Jun 2012 13:16
Location: Italy, Rome

Re: Problems using parameters in a STARTed Batch file

#5 Post by einstein1969 » 16 Jul 2024 14:30

damn I was expecting to see the emoji in the cmd window, but it didn't appear...

I used the same font as the editor, notepad, consolas, you can see them

its normal?
Attachments
Nuova immagine bitmap.jpg
Nuova immagine bitmap.jpg (42.83 KiB) Viewed 44909 times
Nuova immagine bitmap.jpg
Nuova immagine bitmap.jpg (48.91 KiB) Viewed 44909 times

miskox
Posts: 634
Joined: 28 Jun 2010 03:46

Re: Problems using parameters in a STARTed Batch file

#6 Post by miskox » 16 Jul 2024 14:40

Missed part of this thread... I don't see it either:

Code: Select all

@echo off
>nul chcp 65001
echo ┌──┬──┐
echo │  │  │
echo ├──┼──┤
echo │😉│  │
echo └──┴──┘
pause
Saso

aGerman
Expert
Posts: 4684
Joined: 22 Jan 2010 18:01
Location: Germany

Re: Problems using parameters in a STARTed Batch file

#7 Post by aGerman » 16 Jul 2024 15:50

The console/terminal team is permanently about to improve Unicode support. I defintely don't expect it to work on any version older than Win 10. But I actually thought that MS still backports new features to Win 10. I tested on Win 11 and don't have even access to a Win 10 machine anymore.

Console
Screenshot 2024-07-16 234513.png
Screenshot 2024-07-16 234513.png (11.65 KiB) Viewed 44700 times

Windows Terminal
Screenshot 2024-07-16 234400.png
Screenshot 2024-07-16 234400.png (118.96 KiB) Viewed 44700 times

Might also depend on the abilities of the font. "Cascadia Code" in my case.

Steffen

einstein1969
Expert
Posts: 961
Joined: 15 Jun 2012 13:16
Location: Italy, Rome

Re: Problems using parameters in a STARTed Batch file

#8 Post by einstein1969 » 16 Jul 2024 16:14

It's definitely because I'm on Windows 10. Anyway, thanks

miskox
Posts: 634
Joined: 28 Jun 2010 03:46

Re: Problems using parameters in a STARTed Batch file

#9 Post by miskox » 17 Jul 2024 04:50

I am on Win10 too.

But I am not moving to Win 11 just to see this smile.

Saso

aGerman
Expert
Posts: 4684
Joined: 22 Jan 2010 18:01
Location: Germany

Re: Problems using parameters in a STARTed Batch file

#10 Post by aGerman » 17 Jul 2024 10:00

For sure not because of a single smiley :lol:
It may still take a veeeery long time before we get close to the theoretical limit of 1114112 assigned Unicode code points. The majority is still not assigned. However, this number is 4352 times as many as conceivable in a single-byte encoding. Just to give a little indication about the difference it makes. Besides of all kinds of scripts and emojis, it contains mathematical symbols, shapes, dingbats etc.
The biggest advantage is that we don't have to rely on the right local settings. You can have Latin, Cyrillic, Greek, Chinese along with box drawings, blocks, ... all at the same time. So, ultimately we get what is state of the art on web sites for quite a long time (and the reason why we all see the same things here in the forum regardless where we are from). HTML defaults to UTF-8 if not explicitly stated otherwise. And now MS is working on making UTF-8 the standard for user interfaces (as it is a quasi standard on Unix-like operating systems already).

Steffen

miskox
Posts: 634
Joined: 28 Jun 2010 03:46

Re: Problems using parameters in a STARTed Batch file

#11 Post by miskox » 17 Jul 2024 11:53

I completely agree with you Steffen that this is going the right direction. But to have a .bat written for Windows 11 and majority of the users of this .bat are still on Windows 10 delays upgrade.

Thank you.
Saso

Post Reply