Sponsored By

Opinion: A Tale of Two Call StacksOpinion: A Tale of Two Call Stacks

In this reprinted <a href="http://altdevblogaday.com/">#altdevblogaday</a> opinion piece, Valve Software programmer Bruce Dawson explains why graphics vendors should share their code symbols with the public, and puts out a call to action for developers.

Bruce Dawson, Blogger

November 24, 2011

5 Min Read
Game Developer logo in a gray background | Game Developer

[In this reprinted #altdevblogaday opinion piece, Valve Software programmer Bruce Dawson explains why graphics vendors should share their code symbols with the public, and puts out a call to action for developers.] My Kingdom For Some Symbols I spend a large portion of my time at work trying to make things faster and less crashy. Usually the problems I investigate are in our own code, so I have full information – source code and symbols. However, sometimes the problems are at least partially in some other company's code, and the task gets trickier. Mandatory disclaimer: this post represents my opinion, not that of my employer. A Call Stack We Can Believe In For example, last week I investigated a Visual Studio hang. This intermittent hang had been bothering me for months and I finally decided to record an xperf trace of the hang and investigate. The details will be the subject of another post, but one vital clue was this call stack from the xperf CPU scheduling summary table: The call stack is entirely in Microsoft code. It starts in Visual Studio and ends up in Windows, and this call stack shows that Visual Studio hung for 2.585 seconds while trying to CreateFileW so that it can GotoEntry in the CResultList. Even though I know nothing about the Visual Studio architecture, that was enough information to let me understand the problem, and I then changed our project files in order to completely avoid this hang in the future. Shazam! The reason I was able to diagnose this problem is because Microsoft publishes symbols for most of its code on a public symbol server. Symbols are published for Windows, Visual Studio, and much more, and this often lets me fix performance problems and crashes even when they are entirely separate from our code. Yay Microsoft! A Call Stack That Knows How To Keep Its Secrets Another example, not quite so happy, is demonstrated by this call stack. This is sampling profiler data from a thread that is in our game: Huh. This thread sure is using a lot of CPU time. In our process. I wonder what it's doing? Except for the two out 1,036 samples that hit in Windows functions I can only tell that it is NVIDIA code that is executing – there are no indications as to what it is doing. I don't mean to pick on NVIDIA here. Well, to be more accurate, I don't mean to just pick on NVIDIA. This is a problem with all three major graphics vendors – NVIDIA, AMD, and Intel. None of them share symbols with the public and this leaves game developers with a significant problem. When a crash occurs deep in graphics driver code (a not uncommon occurrence), we are helpless. When a frame glitch occurs deep in graphics driver code (also quite common), we are helpless. And when game startup includes excessive memory allocations or CPU time deep in graphics driver code… we are helpless. You Can't Handle The Truth I've been told by some graphics vendors that having symbols would not be valuable to game developers, and might even be confusing. Game developers couldn't possibly understand their cryptic function names, and might misinterpret them. Poppycock. I've solved dozens of performance problems in other people's code, with just symbols to guide me. Having symbols has never been confusing, and has almost always saved me time. If I had symbols for the graphics drivers, then I could solve some problems on my own. I could recognize patterns in the crashes and performance problems that I see. I could give more precise suggestions and bug reports to the graphics vendors. I could more easily figure out what it happening in my code that is causing problems in their code. As it is, I can do almost nothing. Significant CPU time and memory is being consumed in my game's process and I don't have symbols to help understand why. Call To Action If you're a game developer, ask the graphics vendors that you work with for symbols. They'll say no, but it's still important to ask, in order to remind them of the importance of this issue. After they say no, be sure to send them all of the crash dumps and xperf traces where they are a factor and insist that they help you, since they won't let us help ourselves. And, don't forget to share your stories and needs for symbols in the comments. If you're a graphics vendor – please release symbols. I'm confident that if you do it will let us make better games on your hardware, while saving time for your support team, and for game developers. I know that deciding to share symbols is hard, because symbols reveal a lot. I get that. But not sharing symbols with anyone is counterproductive.

Final Disclaimer

I own stock in Microsoft, Intel, and AMD, but not NVIDIA. I hope it has not affected my impartiality. You decide. [This piece was reprinted from #AltDevBlogADay, a shared blog initiative started by @mike_acton devoted to giving game developers of all disciplines a place to motivate each other to write regularly about their personal game development passions.]

About the Author

Bruce Dawson

Blogger

Bruce is the director of technology at Humongous Entertainment, which means he gets to work on all the fun and challenging tasks that nobody else has time to do. He also teaches part-time at DigiPen. Prior to Humongous Entertainment he worked at Cavedog Entertainment, assisting various product teams. Bruce worked for several years at the Elastic Reality branch of Avid Technology, writing special effects software and video editing plug-ins, and before that worked at Electronic Arts Canada, back when it was called Distinctive Software. There he wrote his first computer games, for the Commodore Amiga, back when thirty-two colours was state of the art. Bruce presented a paper at GDC 2001 called "What Happened to my Colours?!?" about the quirks of NTSC. He is currently trying to perfect the ultimate Python script that will automate all of his job duties - so that he can spend more time playing with the console machine he won at a poker game. Bruce lives with his wonderful wife and two exceptional children near Seattle, Washington where he tries to convince his coworkers that all computer programmers should juggle, unicycle and balance on a tight wire. You can contact Bruce Dawson at: [email protected]

Daily news, dev blogs, and stories from Game Developer straight to your inbox

You May Also Like