
<head>
  <title>A Course in Probabilistic and Statistical Modeling in
  Computer Science</title>
  <meta http-equiv="Content-Type" content="text/html">
</head>

<img src=http://heather.cs.ucdavis.edu/~matloff/132/PLN/Cover.gif>

<h1>Open Textbook:</h1>

<h1>From Algorithms to Z-Scores:  Probabilistic
and Statistical Modeling in Computer Science</h1>

<h2><a href="mailto:matloff@cs.ucdavis.edu"> Professor Norm Matloff
</a>, University of California, Davis</h2>

<p>
<strong>OVERVIEW:</strong>
</p>

<p> The materials here form a textbook for a course in mathematical
probability and statistics for computer science students.  (It would
work fine for general students too.)  </p>

<p>
<strong>
"Why is this text different from all other texts?"
</strong>
</p>

<UL>

<li> Computer science examples are used throughout, in areas such as:
computer networks; data and text mining; computer security; remote
sensing; computer performance evaluation; software engineering; data
management; etc.  </li> </p> 

<li> The R statistical/data manipulation language is used throughout.
Since this is a computer science audience, a greater sophistication in
programming can be assumed.  It is recommended that my R tutorials
be used as a supplement: </p> 

   <UL>

   <li>  <a href="http://heather.cs.ucdavis.edu/~matloff/R/NMRIntro.pdf">
   Chapter 1 of my book on R software development</a>, <i>The Art of R
   Programming</i>, NSP, 2011 </li> </p> 

   <li> <a href="http://heather.cs.ucdavis.edu/~matloff/132/NSPpart.pdf">
   Part of a VERY rough and partial draft of that book.</a> It is 
   only about 50% complete, has various errors, and presents a number 
   of topics differently from the final version, but should be useful 
   in R work for this class.  </li> </p> 
   </UL> 

<li> Throughout the units, mathematical theory and applications are
interwoven, with a strong emphasis on <u>modeling</u>:  What do
probabilistic models really mean, in real-life terms?  How does one
choose a model?  How do we assess the practical usefulness of models?
</p>

<p> For instance, the chapter on continuous random variables begins by
explaining that such distributions do not actually exist in the real
world, due to the discreteness of our measuring instruments.  The
continuous model is therefore just that--a <i>model</i>, and indeed a 
very useful model. 
</p>

<p>
There is actually an entire chapter on modeling, discussing the tradeoff
between accuracy and simplicity of models.
</li> </p>  

<li> There is considerable discussion of the intuition involving
probabilistic concepts, and the concepts themselves are defined through
intuition.  However, all models and so on are described
precisely in terms of random variables and distributions. </li> </p> 

</UL>

<p>
For topical coverage, see the book's detailed table of contents.
</p>

<p>
<strong>The materials are continuously evolving</strong>, with new
examples and topics being added.
</p>

<p>
<i>Prerequisites</i>:  The student must know calculus, basic matrix
algebra, and have some minimal skill in programming. 
</p>


<p>
<strong>LICENSING:</strong>
</p>

<p>
<a rel="license" href="http://creativecommons.org/licenses/by-nd/3.0/us/"><img
alt="Creative Commons License" style="border-width:0"
src="http://i.creativecommons.org/l/by-nd/3.0/us/88x31.png"/></a><br/>This
work is licensed under a <a rel="license"
href="http://creativecommons.org/licenses/by-nd/3.0/us/">Creative
Commons Attribution-No Derivative Works 3.0 United States License.
</a>
Copyright is retained by N. Matloff in all non-U.S. jurisdictions,
but permission to use these materials in teaching is still granted, 
provided the authorship and licensing information here is displayed.  
I would appreciate being notified if you use this book for teaching, 
just so that I know the materials are being put to use, but this is 
not required.  
</p>

<strong>DOWNLOAD THE BOOK:</strong>

<p>
The book is freely available, subject to the conditions above, and can
be downloaded from
<a
href="http://heather.cs.ucdavis.edu/~matloff/132/PLN/probstatbook/ProbStatBook.pdf">
http://heather.cs.ucdavis.edu/~matloff/132/PLN/probstatbook/ProbStatBook.pdf</a>.
</li> </p> 

<p>
<strong>AUTHOR'S BIO:</strong>  
</p>

<p>
<a href="http://heather.cs.ucdavis.edu/matloff.html"> Norm Matloff</a> 
is a Professor of Computer Science at the University of 
California, Davis.  He was formerly a statistics professor at that 
university, and thus approaches the subject matter here as both a 
statistician and computer scientist.  His research has included a 
number of diverse areas in the two fields, and he has been a 
recipient of the university's Distinguished Teaching Award.  He was born
and raised in Los Angeles, and earned his PhD in theoretical 
mathematics (probability theory/functional analysis) at UCLA.
</p>
