ChatGPT解决这个技术问题 Extra ChatGPT

In C#, should I use string.Empty or String.Empty or "" to intitialize a string?

In C#, I want to initialize a string value with an empty string.

How should I do this? What is the right way, and why?

string willi = string.Empty;

or

string willi = String.Empty;

or

string willi = "";

or what?

See also this similar discussion for java: stackoverflow.com/questions/213985/…
better to use String.IsNullOrEmpty(string myString) though, surely?
I use [string.IsNullOrWhiteSpace(stringvalue)] ... works in .Net 4.0. To initialize, I simply use: [var text = "";] Simple, readable and takes the least time to type :)
What's more important is the hilarious name of your variable.
The thing that has interested me, is why is there even an Empty property. It is nice and everything, but not a necessary and a complete must.

O
Otiel

Use whatever you and your team find the most readable.

Other answers have suggested that a new string is created every time you use "". This is not true - due to string interning, it will be created either once per assembly or once per AppDomain (or possibly once for the whole process - not sure on that front). This difference is negligible - massively, massively insignificant.

Which you find more readable is a different matter, however. It's subjective and will vary from person to person - so I suggest you find out what most people on your team like, and all go with that for consistency. Personally I find "" easier to read.

The argument that "" and " " are easily mistaken for each other doesn't really wash with me. Unless you're using a proportional font (and I haven't worked with any developers who do) it's pretty easy to tell the difference.


Your eyes can trick you when you are expecting to see "" you can easily mistake " " for "". This is why it is easier to edit something that someone else has written. Your brain doesn't have preconceived ideas about the text so it is easier to pick out the anonmalies.
@tvanfosson: So have you (or a colleague) actually been bitten by this as a bug? I'm suspicious of this sort of claim without it actually having caused problems. I've been using "" for years without ever getting it wrong...
Personally, I've always used String.Empty, I use capital 'S' whenever I want to use a static method on string, it's just a personal prefference that allows me to distinguish a type from a variable. But this is just a carry-over from using StringUtils.EMPTY in commons.lang from java. One point of interest is I'm almost blind and this definately does help with readability for me.
You have given me the inspiration to start developing in Times New Roman.
For some obscure reason string.Empty is not a constant. That means that in a number of cases where a compile-time constant is required, string.Empty isn't even legal. This includes case "" blocks in switch statements, default values of optional parameters, parameters and properties in applying attributes, and a lot of other situations (left to the reader). So given that string.Empty is disallowed in some common situations, it is better to use the ""-everywhere convention.
B
Blorgbeard

There really is no difference from a performance and code generated standpoint. In performance testing, they went back and forth between which one was faster vs the other, and only by milliseconds.

In looking at the behind the scenes code, you really don't see any difference either. The only difference is in the IL, which string.Empty use the opcode ldsfld and "" uses the opcode ldstr, but that is only because string.Empty is static, and both instructions do the same thing. If you look at the assembly that is produced, it is exactly the same.

C# Code

private void Test1()
{
    string test1 = string.Empty;    
    string test11 = test1;
}

private void Test2()
{
    string test2 = "";    
    string test22 = test2;
}

IL Code

.method private hidebysig instance void 
          Test1() cil managed
{
  // Code size       10 (0xa)
  .maxstack  1
  .locals init ([0] string test1,
                [1] string test11)
  IL_0000:  nop
  IL_0001:  ldsfld     string [mscorlib]System.String::Empty
  IL_0006:  stloc.0
  IL_0007:  ldloc.0
  IL_0008:  stloc.1
  IL_0009:  ret
} // end of method Form1::Test1
.method private hidebysig instance void 
        Test2() cil managed
{
  // Code size       10 (0xa)
  .maxstack  1
  .locals init ([0] string test2,
                [1] string test22)
  IL_0000:  nop
  IL_0001:  ldstr      ""
  IL_0006:  stloc.0
  IL_0007:  ldloc.0
  IL_0008:  stloc.1
  IL_0009:  ret
} // end of method Form1::Test2

Assembly code

        string test1 = string.Empty;
0000003a  mov         eax,dword ptr ds:[022A102Ch] 
0000003f  mov         dword ptr [ebp-40h],eax 

        string test11 = test1;
00000042  mov         eax,dword ptr [ebp-40h] 
00000045  mov         dword ptr [ebp-44h],eax 
        string test2 = "";
0000003a  mov         eax,dword ptr ds:[022A202Ch] 
00000040  mov         dword ptr [ebp-40h],eax 

        string test22 = test2;
00000043  mov         eax,dword ptr [ebp-40h] 
00000046  mov         dword ptr [ebp-44h],eax 

@PrateekSaluja: To see the IL you can use ildasm.exe, which ships with Visual Studio. To see diassembly, use the 'Disassembly' window on the debug menu when you hit a breakpoint (works in release code too).
Hate that i am about to recommend this product.. BUT .. reflector lets you choose your language when disassembling the source and IL is an option! ILDASM is just dated feeling... MS tools team does not seem to polish or release the good tools!
K
Konrad Rudolph

The best code is no code at all:

The fundamental nature of coding is that our task, as programmers, is to recognize that every decision we make is a trade-off. […] Start with brevity. Increase the other dimensions as required by testing.

Consequently, less code is better code: Prefer "" to string.Empty or String.Empty. Those two are six times longer with no added benefit — certainly no added clarity, as they express the exact same information.


but in C# we can just say string.IsNullOrWhitespace(s) :p
I agree that code should be as small as possible but wouldn't generally argue that less characters is always better code. When it comes to variable naming for example, a reasonable amount of characters generally result in better names than just using i and j.
@Markus That highly depends: for a loop variable representing an index, i is better than a long variable name. Even more general, shorter variable names that convey the same information, in the same clarity, are always preferable. It’s just that to express the necessary information you need a certain character length, and I’m not denying this (nobody is).
@Konrad: i is a good variable name only if the loop is small and it doesn't contain any other indices. But I agree that if sth. can be stated more briefly conveying the same information, that would be preferrable, like in the string.Empty / "" case. The string.Empty does not add any clarity.
To me: string.Empty says this string is and should be Empty at all times, whereas "" says at the time of writing this string could be empty but you are free to change it.
K
Kolappan N

One difference is that if you use a switch-case syntax, you can't write case string.Empty: because it's not a constant. You get a Compilation error : A constant value is expected

Look at this link for more info: string-empty-versus-empty-quotes


The switch statement is one very good example. Also, if you make an optional parameter, like void MyMethod(string optional = "") { ... }, it's also not possible to use string.Empty. And then of course if you want to define a const field or local variable, const string myString = ""; the again "" is the only option. If only string.Empty were a constant field, there wouldn't be a difference. But it's not, so in some cases you have to use "". So why not use "" all the time?
This is a really strong argument because using string.Empty prevents you from attaining consistency in your code base: you must use two different entities to express the same thing. And to add to the list of things you can't do: you can't use string.Empty with attributes.
Very good points! The link is broken. Here's a copy of the content: web.archive.org/web/20131230161806/http://kossovsky.net/…
C
CodesInChaos

I'd prefer string to String. choosing string.Empty over "" is a matter of choosing one and sticking with it. Advantage of using string.Empty is it is very obvious what you mean, and you don't accidentally copy over non-printable characters like "\x003" in your "".


I'd argue that if you're accidentally copying non-printable characters into your code, you've got bigger problems than this question ;)
ASCII \003 happens to be a field delimiter for B2B Messages I've worked with :)
(I'd also suggest avoiding the \x escape, btw - it's too hard to spot the difference between "\x9Bad Compiler" and "\x9Good Compiler" which have radically different results!)
Personally I prefer String over string whenever calling a static method on String. I'm almost blind however and this is a personal preference that I don't enforce on anyone.
@Jimmy Sure, but we were talking about the empty string. The argument that "" is dangerous when copy/pasting is void, I claim, because you never copy/paste the empty string. For other strings, it is always something to be mindful of, of course.
B
Brian Leahy

I wasn't going to chime in, but I'm seeing some wrong info getting tossed out here.

I, personally, prefer string.Empty. That's a personal preference, and I bend to the will of whatever team I work with on a case-by-case basis.

As some others have mentioned, there is no difference at all between string.Empty and String.Empty.

Additionally, and this is a little known fact, using "" is perfectly acceptable. Every instance of "" will, in other environments, create an object. However, .NET interns its strings, so future instances will pull the same immutable string from the intern pool, and any performance hit will be negligible. Source: Brad Abrams.


I don't see why "technically" every instance of "" will create an object. It's not just chance that strings are interned - it's in the C# spec.
There is a difference between string.Empty and String.Empty. The first uses the C# language keyword string so it always works. The latter only works with using System; in the file. Prefer string over String always.
M
MidnightGun

I personally prefer "" unless there is a good reason to something more complex.


P
Palec

String.Empty and string.Empty are equivalent. String is the BCL class name; string is its C# alias (or shortcut, if you will). Same as with Int32 and int. See the docs for more examples.

As far as "" is concerned, I'm not really sure.

Personally, I always use string.Empty.


R
RBT

I performed a simple test using following method in a .NET v4.5 console application:

private static void CompareStringConstants()
{
    string str1 = "";
    string str2 = string.Empty;
    string str3 = String.Empty;
    Console.WriteLine(object.ReferenceEquals(str1, str2)); //prints True
    Console.WriteLine(object.ReferenceEquals(str2, str3)); //prints True
}

This suggests that all three variables namely str1, str2 and str3 though being initialized using different syntax are pointing to the same string (of zero length) object in memory .

So internally they have no difference. And it all boils down to convenience of which one you or your team wants to use. This behavior of string class is known as string interning in .NET Framework. Eric Lippert has a very nice blog here describing this concept.


S
Steve

This topic is pretty old and long, so excuse me if this behavior has been mentioned somewhere else. (And point me to the answer that covers this)

I have found a difference in the behavior of the compiler if you use string.Empty or double quotes. The difference shows itself if you don't use the string variable initialized with string.Empty or with double quotes.

In case of initialization with string.Empty then the Compiler Warning

CS0219 - The variable 'x' is assigned but its value is never used

is never emitted while in case of initialization with double quotes you get the expected message.

This behavior is explained in the Connect article at this link: https://connect.microsoft.com/VisualStudio/feedback/details/799810/c-warning-cs0219-not-reported-when-assign-non-constant-value

Basically, if I get it right, they want to allow a programmer to set a variable with the return value of a function for debugging purposes without bothering him with a warning message and thus they limited the warning only in case of costant assignments and string.Empty is not a constant but a field.


I believe you’re the first one to mention. I have read through this Q&A several months ago and don’t remember this difference, which I would if it had been mentioned.
Interesting. Note that a declaration var unused = "literal"; can be completely optimized away (removed) by the compiler. It can have no side effects. On the other hand, var unused = MyClass.Member; cannot be removed entirely. That is because reading Member could have side effects. If Member is a static property with a get accessor, it is clear that the call to the getter must be kept. But even if Member is a static field, there may be the side effect that the static constructor may run. Sure it would be bad coding style to have it that way. But you need a dummy to read Member.
J
Jason Baker

Just about every developer out there will know what "" means. I personally encountered String.Empty the first time and had to spend some time searching google to figure out if they really are the exact same thing.


It is a public readonly string field and its values is ""... why would that change?
You miss the point that @Jason makes. How do you know what it is the first time you see string.Empty? Did you know what "" was the first time you saw it?
Q
Quibblesome

Any of the above.

There are many, many better things to pontificate. Such as what colour bark suits a tree best, I think vague brown with tinges of dulcet moss.


佚名

I strongly prefer String.Empty, aside from the other reasons to ensure you know what it is and that you have not accidentally removed the contents, but primarily for internationalization. If I see a string in quotes then I always have to wonder whether that is new code and it should be put into a string table. So every time code gets changed/reviewed you need to look for "something in quotes" and yes you can filter out the empty strings but I tell people it is good practice to never put strings in quotes unless you know it won't get localized.


D
Dimitry

No one mentioned that in VisualStudio String is color coded differently then string. Which is important for readability. Also, lower case is usually used for vars and type, not a big deal but String.Empty is a constant and not a var or type.


String.Empty is not a constant: see stackoverflow.com/questions/507923/… It's actually an instance of the String class, by design. And coloring was mentioned before though a bit hidden: stackoverflow.com/questions/263191/…
z
zendar

string is synonym for System.String type, They are identical.

Values are also identical: string.Empty == String.Empty == ""

I would not use character constant "" in code, rather string.Empty or String.Empty - easier to see what programmer meant.

Between string and String I like lower case string more just because I used to work with Delphi for lot of years and Delphi style is lowercase string.

So, if I was your boss, you would be writing string.Empty


R
RBT

I would favor string.Empty over String.Empty because you can use it without needing to include a using System; in your file.

As for the picking "" over string.Empty, it is personal preference and should be decided by your team.


I'm the only member of the team, how do I decide? throw a dice?
For those who might be wondering as to how it is possible to use string.Empty constant without importing using System namespace - Keywords in C# simply get converted to their fully qualified name which includes the namespace as well before being written as MSIL in the output *.dll or *.exe file. So effectively string.Empty gets written as System.String.Empty in the MSIL by the compiler. And as you already might be knowing that if you mention fully qualified type name then you can give a skip to importing namespaces at the top of your code file.
佚名

I doesn't make a difference. The last one is the quickest to type though :)


m
mckamey

It is totally a code-style preference, do to how .NET handles strings. However, here are my opinions :)

I always use the BCL Type names when accessing static methods, properties and fields: String.Empty or Int32.TryParse(...) or Double.Epsilon

I always use the C# keywords when declaring new instances: int i = 0; or string foo = "bar";

I rarely use undeclared string literals as I like to be able to scan the code to combine them into reusable named constants. The compiler replaces constants with the literals anyway so this is more of a way to avoid magic strings/numbers and to give a little more meaning to them with a name. Plus changing the values is easier.


C
Calanus

It doesn't matter - they are exactly the same thing. However, the main thing is that you must be consistent

p.s. I struggle with this sort of "whats the right thing" all the time.


In the modern world, "consistent" means consistent across all teams world-wide, which is one of the StackOverflow's goals. If I may suggest, let's use String.Empty.
Some languages don't have an Empty constant, and all of the languages I can think of allow for "" a zero-length string. So I vote for "" for consistency with other languages. :)
R
RhinoTX

I have personally witnessed "" resulting in (minor) problems twice. Once was due to a mistake of a junior developer new to team-based programming, and the other was a simple typo, but the fact is using string.Empty would have avoided both issues.

Yes, this is very much a judgement call, but when a language gives you multiple ways to do things, I tend to lean toward the one that has the most compiler oversight and strongest compile-time enforcement. That is not "". It's all about expressing specific intent.

If you type string.EMpty or Strng.Empty, the compiler lets you know you did it wrong. Immediately. It simply will not compile. As a developer you are citing specific intent that the compiler (or another developer) cannot in any way misinterpret, and when you do it wrong, you can't create a bug.

If you type " " when you mean "" or vice-versa, the compiler happily does what you told it to do. Another developer may or may not be able to glean your specific intent. Bug created.

Long before string.Empty was a thing I've used a standard library that defined the EMPTY_STRING constant. We still use that constant in case statements where string.Empty is not allowed.

Whenever possible, put the compiler to work for you, and eliminate the possibility of human error, no matter how small. IMO, this trumps "readability" as others have cited.

Specificity and compile time enforcement. It's what's for dinner.


p
plinth

I use the third, but of the other two the first seems less odd. string is an alias for String, but seeing them across an assignment feels off.


t
tvanfosson

Either of the first two would be acceptable to me. I would avoid the last one because it is relatively easy to introduce a bug by putting a space between the quotes. This particular bug would be difficult to find by observation. Assuming no typos, all are semantically equivalent.

[EDIT]

Also, you might want to always use either string or String for consistency, but that's just me.


I agree with this remark, but I still live dangerously when I'm lazy. In any event, I don't think I have occasion to write code that uses a string before assigning to it outside of the variable declaration. In fact, it's annoying to me that I have to initialize my strings at all, despite the risks.
R
Remotec

I was just looking at some code and this question popped into my mind which I had read some time before. This is certainly a question of readability.

Consider the following C# code...

(customer == null) ? "" : customer.Name

vs

(customer == null) ? string.empty : customer.Name

I personally find the latter less ambiguous and easier to read.

As pointed out by others the actual differences are negligible.


C
Chris Marasti-Georg

The compiler should make them all the same in the long run. Pick a standard so that your code will be easy to read, and stick with it.


u
user34577

While difference is very, VERY little, the difference still exist.

1) "" creates object while String.Empty does not. But this object will be created once and will be referenced from the string pool later if you have another "" in the code.

2) String and string are the same, but I would recommend to use String.Empty (as well as String.Format, String.Copy etc.) since dot notation indicates class, not operator, and having class starting with capital letter conforms to C# coding standards.


string.Empty is "", check the source
5
5argon

I use "" because it will be colored distinctively yellow in my code... for some reason String.Empty is all white in my Visual Studio Code theme. And I believe that matters to me the most.


M
Magus

I think the second is "proper," but to be honest I don't think it will matter. The compiler should be smart enough to compile any of those to the exact same bytecode. I use "" myself.


O
Otiel

On http://blogs.msdn.com/b/brada/archive/2003/04/22/49997.aspx :

As David implies, there difference between String.Empty and "" are pretty small, but there is a difference. "" actually creates an object, it will likely be pulled out of the string intern pool, but still... while String.Empty creates no object... so if you are really looking for ultimately in memory efficiency, I suggest String.Empty. However, you should keep in mind the difference is so trival you will like never see it in your code... As for System.String.Empty or string.Empty or String.Empty... my care level is low ;-)


That MSDN blog post was in 2003.... are you sure that this is still true for recent .NET versions?!
@CarstenSchütte: Seems to me that such a feature is not intended to change very much ... and if it was, there was some buzz on the Internet about it.
@sergiol If a field is more efficient than a literal then this is unambiguously a performance bug. So there’s hoping that it would be fixed by now.
W
Wouter

The empty string is like empty set just a name that everybody uses to call "". Also in formal languages strings created from an alphabet that have zero length are called the empty string. Both set and string have a special symbol for it. Empty string: ε and empty set: ∅. If you want to talk about this zero length string you will call it the empty string so everybody knows exactly what you are referring to. Now in case you name it the empty string why not use string.Empty in code, its shows the intention is explicit. Downside is that it’s not a constant and therefore not available everywhere, like in attributes. (It's not a constant for some technical reasons, see the reference source.)


K
Keith Howard

Possibly a controversial comment, but, generally, I find that my life is easier when I act consistently with Microsoft. We can't possibly know the full deeply embedded reasons (sometimes highly rigorous, and sometime kludgy, I imagine) for why they do things.

They use "" in automatically generated files like the Assembly file, so that is what I do. In fact, when I try to replace any below "" with String.Empty, Visual Studio crashes on me. There is probably a logical explanation for this, but with my limited knowledge, if I just do what they do, most of the time, things work out. (Contra: I am aware they some automatically generated files also use String.Empty, which kind of shatters my point. :) )

<Assembly: System.Reflection.AssemblyCulture("")>
<Assembly: System.Reflection.AssemblyDescription("")>
<Assembly: System.Reflection.AssemblyFileVersion("1.0.0.0")>
<Assembly: System.Reflection.AssemblyKeyFile("")>
<Assembly: System.Reflection.AssemblyProduct("")>
<Assembly: System.Reflection.AssemblyTitle("")>

Expand your knowledge by reading the other answers and by taking your time to actually understand them. :-) Maintaining consistency with the rest of your codebase and with other codebases (.NET itself, generated code...) is a good rule of thumb, but it is not always clearly applicable and sometimes even harmful -- e.g. you need to do something very specific or you are the first one to spot a deficiency in current practices on the team. Moreover, consistency with something you don't understand may lead to cargo cult programming.