Arun's Blogspot

Monday, July 30, 2007

Learn SQL

SQL is a standard computer language for accessing and manipulating databases.
What is SQL?
SQL stands for Structured Query Language
SQL allows you to access a database
SQL is an ANSI standard computer language
SQL can execute queries against a database
SQL can retrieve data from a database
SQL can insert new records in a database
SQL can delete records from a database
SQL can update records in a database
SQL is easy to learn
SQL is a Standard - BUT....
SQL is an ANSI (American National Standards Institute) standard computer language for accessing and manipulating database systems. SQL statements are used to retrieve and update data in a database. SQL works with database programs like MS Access, DB2, Informix, MS SQL Server, Oracle, Sybase, etc.
Unfortunately, there are many different versions of the SQL language, but to be in compliance with the ANSI standard, they must support the same major keywords in a similar manner (such as SELECT, UPDATE, DELETE, INSERT, WHERE, and others).
Note: Most of the SQL database programs also have their own proprietary extensions in addition to the SQL standard!
SQL Database Tables
A database most often contains one or more tables. Each table is identified by a name (e.g. "Customers" or "Orders"). Tables contain records (rows) with data.
Below is an example of a table called "Persons":
LastName
FirstName
Address
City
Hansen
Ola
Timoteivn 10
Sandnes
Svendson
Tove
Borgvn 23
Sandnes
Pettersen
Kari
Storgt 20
Stavanger
The table above contains three records (one for each person) and four columns (LastName, FirstName, Address, and City).
SQL Queries
With SQL, we can query a database and have a result set returned.
A query like this:
SELECT LastName FROM Persons
Gives a result set like this:
LastName
Hansen
Svendson
Pettersen
Note: Some database systems require a semicolon at the end of the SQL statement. We don't use the semicolon in our tutorials.
SQL Data Manipulation Language (DML)
SQL (Structured Query Language) is a syntax for executing queries. But the SQL language also includes a syntax to update, insert, and delete records.
These query and update commands together form the Data Manipulation Language (DML) part of SQL:
SELECT - extracts data from a database table
UPDATE - updates data in a database table
DELETE - deletes data from a database table
INSERT INTO - inserts new data into a database table
SQL Data Definition Language (DDL)
The Data Definition Language (DDL) part of SQL permits database tables to be created or deleted. We can also define indexes (keys), specify links between tables, and impose constraints between database tables.
The most important DDL statements in SQL are:
CREATE TABLE - creates a new database table
ALTER TABLE - alters (changes) a database table
DROP TABLE - deletes a database table
CREATE INDEX - creates an index (search key)
DROP INDEX - deletes an index
The SQL SELECT Statement
The SELECT statement is used to select data from a table. The tabular result is stored in a result table (called the result-set).
Syntax
SELECT column_name(s)FROM table_name
Note: SQL statements are not case sensitive. SELECT is the same as select.
SQL SELECT Example
To select the content of columns named "LastName" and "FirstName", from the database table called "Persons", use a SELECT statement like this:
SELECT LastName,FirstName FROM Persons
The database table "Persons":
LastName
FirstName
Address
City
Hansen
Ola
Timoteivn 10
Sandnes
Svendson
Tove
Borgvn 23
Sandnes
Pettersen
Kari
Storgt 20
Stavanger
The result
LastName
FirstName
Hansen
Ola
Svendson
Tove
Pettersen
Kari

Select All Columns
To select all columns from the "Persons" table, use a * symbol instead of column names, like this:
SELECT * FROM Persons
Result
LastName
FirstName
Address
City
Hansen
Ola
Timoteivn 10
Sandnes
Svendson
Tove
Borgvn 23
Sandnes
Pettersen
Kari
Storgt 20
Stavanger

The Result Set
The result from a SQL query is stored in a result-set. Most database software systems allow navigation of the result set with programming functions, like: Move-To-First-Record, Get-Record-Content, Move-To-Next-Record, etc.
Programming functions like these are not a part of this tutorial. To learn about accessing data with function calls, please visit our ADO tutorial.
Semicolon after SQL Statements?
Semicolon is the standard way to separate each SQL statement in database systems that allow more than one SQL statement to be executed in the same call to the server.
Some SQL tutorials end each SQL statement with a semicolon. Is this necessary? We are using MS Access and SQL Server 2000 and we do not have to put a semicolon after each SQL statement, but some database programs force you to use it.
The SELECT DISTINCT Statement
The DISTINCT keyword is used to return only distinct (different) values.
The SELECT statement returns information from table columns. But what if we only want to select distinct elements?
With SQL, all we need to do is to add a DISTINCT keyword to the SELECT statement:
Syntax
SELECT DISTINCT column_name(s)FROM table_name

Using the DISTINCT keyword
To select ALL values from the column named "Company" we use a SELECT statement like this:
SELECT Company FROM Orders
"Orders" table
Company
OrderNumber
Sega
3412
W3Schools
2312
Trio
4678
W3Schools
6798
Result
Company
Sega
W3Schools
Trio
W3Schools
Note that "W3Schools" is listed twice in the result-set.
To select only DIFFERENT values from the column named "Company" we use a SELECT DISTINCT statement like this:
SELECT DISTINCT Company FROM Orders
Result:
Company
Sega
W3Schools
Trio
Now "W3Schools" is listed only once in the result-set.
The WHERE clause is used to specify a selection criterion.
The WHERE Clause
To conditionally select data from a table, a WHERE clause can be added to the SELECT statement.
Syntax
SELECT column FROM tableWHERE column operator value
With the WHERE clause, the following operators can be used:
Operator
Description
=
Equal
<>
Not equal
>
Greater than
<
Less than
>=
Greater than or equal
<=
Less than or equal
BETWEEN
Between an inclusive range
LIKE
Search for a pattern
IN
If you know the exact value you want to return for at least one of the columns
Note: In some versions of SQL the <> operator may be written as !=
Using the WHERE Clause
To select only the persons living in the city "Sandnes", we add a WHERE clause to the SELECT statement:
SELECT * FROM PersonsWHERE City='Sandnes'
"Persons" table
LastName
FirstName
Address
City
Year
Hansen
Ola
Timoteivn 10
Sandnes
1951
Svendson
Tove
Borgvn 23
Sandnes
1978
Svendson
Stale
Kaivn 18
Sandnes
1980
Pettersen
Kari
Storgt 20
Stavanger
1960
Result
LastName
FirstName
Address
City
Year
Hansen
Ola
Timoteivn 10
Sandnes
1951
Svendson
Tove
Borgvn 23
Sandnes
1978
Svendson
Stale
Kaivn 18
Sandnes
1980

Using Quotes
Note that we have used single quotes around the conditional values in the examples.
SQL uses single quotes around text values (most database systems will also accept double quotes). Numeric values should not be enclosed in quotes.
For text values:
This is correct:SELECT * FROM Persons WHERE FirstName='Tove'This is wrong:SELECT * FROM Persons WHERE FirstName=Tove
For numeric values:
This is correct:SELECT * FROM Persons WHERE Year>1965This is wrong:SELECT * FROM Persons WHERE Year>'1965'

The LIKE Condition
The LIKE condition is used to specify a search for a pattern in a column.
Syntax
SELECT column FROM tableWHERE column LIKE pattern
A "%" sign can be used to define wildcards (missing letters in the pattern) both before and after the pattern.
Using LIKE
The following SQL statement will return persons with first names that start with an 'O':
SELECT * FROM PersonsWHERE FirstName LIKE 'O%'
The following SQL statement will return persons with first names that end with an 'a':
SELECT * FROM PersonsWHERE FirstName LIKE '%a'
The following SQL statement will return persons with first names that contain the pattern 'la':
SELECT * FROM PersonsWHERE FirstName LIKE '%la%'
The INSERT INTO Statement
The INSERT INTO statement is used to insert new rows into a table.
Syntax
INSERT INTO table_nameVALUES (value1, value2,....)
You can also specify the columns for which you want to insert data:
INSERT INTO table_name (column1, column2,...)VALUES (value1, value2,....)

Insert a New Row
This "Persons" table:
LastName
FirstName
Address
City
Pettersen
Kari
Storgt 20
Stavanger
And this SQL statement:
INSERT INTO Persons VALUES ('Hetland', 'Camilla', 'Hagabakka 24', 'Sandnes')
Will give this result:
LastName
FirstName
Address
City
Pettersen
Kari
Storgt 20
Stavanger
Hetland
Camilla
Hagabakka 24
Sandnes

Insert Data in Specified Columns
This "Persons" table:
LastName
FirstName
Address
City
Pettersen
Kari
Storgt 20
Stavanger
Hetland
Camilla
Hagabakka 24
Sandnes
And This SQL statement:
INSERT INTO Persons (LastName, Address)VALUES ('Rasmussen', 'Storgt 67')
Will give this result:
LastName
FirstName
Address
City
Pettersen
Kari
Storgt 20
Stavanger
Hetland
Camilla
Hagabakka 24
Sandnes
Rasmussen

Storgt 67

The Update Statement
The UPDATE statement is used to modify the data in a table.
Syntax
UPDATE table_nameSET column_name = new_valueWHERE column_name = some_value

Person:
LastName
FirstName
Address
City
Nilsen
Fred
Kirkegt 56
Stavanger
Rasmussen

Storgt 67

Update one Column in a Row
We want to add a first name to the person with a last name of "Rasmussen":
UPDATE Person SET FirstName = 'Nina'WHERE LastName = 'Rasmussen'
Result:
LastName
FirstName
Address
City
Nilsen
Fred
Kirkegt 56
Stavanger
Rasmussen
Nina
Storgt 67

Update several Columns in a Row
We want to change the address and add the name of the city:
UPDATE PersonSET Address = 'Stien 12', City = 'Stavanger'WHERE LastName = 'Rasmussen'
Result:
LastName
FirstName
Address
City
Nilsen
Fred
Kirkegt 56
Stavanger
Rasmussen
Nina
Stien 12
Stavanger
The DELETE Statement
The DELETE statement is used to delete rows in a table.
Syntax
DELETE FROM table_nameWHERE column_name = some_value

Person:
LastName
FirstName
Address
City
Nilsen
Fred
Kirkegt 56
Stavanger
Rasmussen
Nina
Stien 12
Stavanger

Delete a Row
"Nina Rasmussen" is going to be deleted:
DELETE FROM Person WHERE LastName = 'Rasmussen'
Result
LastName
FirstName
Address
City
Nilsen
Fred
Kirkegt 56
Stavanger

Delete All Rows
It is possible to delete all rows in a table without deleting the table. This means that the table structure, attributes, and indexes will be intact:
DELETE FROM table_nameorDELETE * FROM table_name
Test your SQL Skills
On this page you can test your SQL skills.
We will use the Customers table in the Northwind database:
CompanyName
ContactName
Address
City
Alfreds Futterkiste
Maria Anders
Obere Str. 57
Berlin
Berglunds snabbköp
Christina Berglund
Berguvsvägen 8
Luleå
Centro comercial Moctezuma
Francisco Chang
Sierras de Granada 9993
México D.F.
Ernst Handel
Roland Mendel
Kirchgasse 6
Graz
FISSA Fabrica Inter. Salchichas S.A.
Diego Roel
C/ Moralzarzal, 86
Madrid
Galería del gastrónomo
Eduardo Saavedra
Rambla de Cataluña, 23
Barcelona
Island Trading
Helen Bennett
Garden House Crowther Way
Cowes
Königlich Essen
Philip Cramer
Maubelstr. 90
Brandenburg
Laughing Bacchus Wine Cellars
Yoshi Tannamuri
1900 Oak St.
Vancouver
Magazzini Alimentari Riuniti
Giovanni Rovelli
Via Ludovico il Moro 22
Bergamo
North/South
Simon Crowther
South House 300 Queensbridge
London
Paris spécialités
Marie Bertrand
265, boulevard Charonne
Paris
Rattlesnake Canyon Grocery
Paula Wilson
2817 Milton Dr.
Albuquerque
Simons bistro
Jytte Petersen
Vinbæltet 34
København
The Big Cheese
Liz Nixon
89 Jefferson Way Suite 2
Portland
Vaffeljernet
Palle Ibsen
Smagsløget 45
Århus
Wolski Zajazd
Zbyszek Piestrzeniewicz
ul. Filtrowa 68
Warszawa
To preserve space, the table above is a subset of the Customers table used in the example below.
Try it Yourself
To see how SQL works, you can copy the SQL statements below and paste them into the textarea, or you can make your own SQL statements.
SELECT * FROM customers

SELECT CompanyName, ContactName
FROM customers

SELECT * FROM customers
WHERE companyname LIKE 'a%'

SELECT CompanyName, ContactName
FROM customers
WHERE CompanyName > 'g'
AND ContactName > 'g'
Top of Form
SELECT * FROM customers
Bottom of Form

Tuesday, July 24, 2007

Solaris Useful Commands

Commands

TRUSS

truss -c (Solaris >= 8): This astounding option to truss provides a profile summary of the command being trussed

$ truss -c grep asdf work.doc

syscall              seconds   calls  errors

_exit                    .00       1

read                     .01      24

open                     .00       8      4

close                    .00       5

brk                      .00      15

stat                     .00       1

fstat                    .00       4

execve                   .00       1

mmap                     .00      10

munmap                   .01       3

memcntl                  .00       2

llseek                   .00       1

open64                   .00       1

                        ----     ---    ---

sys totals:              .02      76      4

usr time:                .00

elapsed:                 .05

It can also show profile data on a running process. In this case, the data shows what the process did between when truss was started and when truss execution was terminated with a control-c. It’s ideal for determining why a process is hung without having to wade through the pages of truss output.

truss -d and truss -D (Solaris >= 8): These truss options show the time associated with each system call being shown by truss and is excellent for finding performance problems in custom or commercial code. For example:

$ truss -d who

Base time stamp:  1035385727.3460  [ Wed Oct 23 11:08:47 EDT 2002 ]

 0.0000 execve(“/usr/bin/who”, 0xFFBEFD5C, 0xFFBEFD64)  argc = 1

 0.0032 stat(“/usr/bin/who”, 0xFFBEFA98)                = 0

 0.0037 open(“/var/ld/ld.config”, O_RDONLY)             Err#2 ENOENT

 0.0042 open(“/usr/local/lib/libc.so.1”, O_RDONLY)      Err#2 ENOENT

 0.0047 open(“/usr/lib/libc.so.1”, O_RDONLY)            = 3

 0.0051 fstat(3, 0xFFBEF42C)                            = 0

. . .

truss -D is even more useful, showing the time delta between system calls:

Dilbert> truss -D who

 0.0000 execve(“/usr/bin/who”, 0xFFBEFD5C, 0xFFBEFD64)  argc = 1

 0.0028 stat(“/usr/bin/who”, 0xFFBEFA98)                = 0

 0.0005 open(“/var/ld/ld.config”, O_RDONLY)             Err#2 ENOENT

 0.0006 open(“/usr/local/lib/libc.so.1”, O_RDONLY)      Err#2 ENOENT

 0.0005 open(“/usr/lib/libc.so.1”, O_RDONLY)            = 3

 0.0004 fstat(3, 0xFFBEF42C)                            = 0

In this example, the stat system call took a lot longer than the others.

truss -T: This is a great debugging help. It will stop a process at the execution of a specified system call. (“-U” does the same, but with user-level function calls.) A core could then be taken for further analysis, or any of the /proc tools could be used to determine many aspects of the status of the process.

truss -l (improved in Solaris 9): Shows the thread number of each call in a multi-threaded process. Solaris 9 truss -l finally makes it possible to watch the execution of a multi-threaded application.

Truss is truly a powerful tool. It can be used on core files to analyze what caused the problem, for example. It can also show details on user-level library calls (either system libraries or programmer libraries) via the “-u” option.

PKG-GET

pkg-get: This is a nice tool (http://www.bolthole.com/solaris) for automatically getting freeware packages. It is configured via /etc/pkg-get.conf. Once it’s up and running, execute pkg-get -a to get a list of available packages, and pkg-get -i to get and install a given package.

COREADM

coreadm (Solaris >= 8): In the “old” days (before coreadm), core dumps were placed in the process’s working directory. Core files would also overwrite each other. All this and more has been addressed by coreadm, a tool to manage core file creation. With it, you can specify whether to save cores, where cores should be stored, how many versions should be retained, and more. Settings can be retained between reboots by coreadm modifying /etc/coreadm.conf.

PROCESS MANAGEMENT

pgrep (Solaris >= 8): pgrep searches through /proc for processes matching the given criteria, and returns their process-ids. A great option is “-n”, which returns the newest process that matches.

preap (Solaris >= 9): Reaps zombie processes. Any processes stuck in the “z” state (as shown by ps), can be removed from the system with this command.

pargs (Solaris >= 9): Shows the arguments and environment variables of a process.

nohup -p (Solaris >= 9): The nohup command can be used to start a process, so that if the shell that started the process closes (i.e., the process gets a “SIGHUP” signal), the process will keep running. This is useful for backgrounding a task that should continue running no matter what happens around it. But what happens if you start a process and later want to HUP-proof it? With Solaris 9, nohup -p takes a process-id and causes SIGHUP to be ignored.

plimit (Solaris >= 8): This command displays and sets the per-process limits on a running process. This is handy if a long-running process is running up against a limit (for example, number of open files). Rather than using limit and restarting the command, plimit can modify the running process.

prstat (Solaris >= 8): prstat is top and a lot more. Both commands provide a screen’s worth of process and other information and update it frequently, for a nice window on system performance. prstat has much better accuracy than top. It also has some nice options. “-a” shows process and user information concurrently (sorted by CPU hog, by default). “-c” causes it to act like vmstat (new reports printed below old ones). “-C” shows processes in a processor set. “-j” shows processes in a “project”. “-L” shows per-thread information as well as per-process. “-m” and “-v” show quite a bit of per-process performance detail (including pages, traps, lock wait, and CPU wait). The output data can also be sorted by resident-set (real memory) size, virtual memory size, execute time, and so on. prstat is very useful on systems without top, and should probably be used instead of top because of its accuracy (and some sites care that it is a supported program).

trapstat (Solaris >= 9): trapstat joins lockstat and kstat as the most inscrutable commands on Solaris. Each shows gory details about the innards of the running operating system. Each is indispensable in solving strange happenings on a Solaris system. Best of all, their output is good to send along with bug reports, but further study can reveal useful information for general use as well.

vmstat -p (Solaris >= 8): Until this option became available, it was almost impossible (see the “se toolkit”) to determine what kind of memory demand was causing a system to page. vmstat -p is key because it not only shows whether your system is under memory stress (via the “sr” column), it also shows whether that stress is from application code, application data, or I/O. “-p” can really help pinpoint the cause of any mysterious memory issues on Solaris.

pmap -x (Solaris >= 8, bugs fixed in Solaris >= 9): If the process with memory problems is known, and more details on its memory use are needed, check out pmap -x. The target process-id has its memory map fully explained, as in:

# pmap -x 1779

1779:   -ksh

 Address  Kbytes     RSS    Anon  Locked Mode   Mapped File

00010000     192     192       -       - r-x--  ksh

00040000       8       8       8       - rwx--  ksh

00042000      32      32       8       - rwx--    [ heap ]

FF180000     680     664       -       - r-x--  libc.so.1

FF23A000      24      24       -       - rwx--  libc.so.1

FF240000       8       8       -       - rwx--  libc.so.1

FF280000     568     472       -       - r-x--  libnsl.so.1

FF31E000      32      32       -       - rwx--  libnsl.so.1

FF326000      32      24       -       - rwx--  libnsl.so.1

FF340000      16      16       -       - r-x--  libc_psr.so.1

FF350000      16      16       -       - r-x--  libmp.so.2

FF364000       8       8       -       - rwx--  libmp.so.2

FF380000      40      40       -       - r-x--  libsocket.so.1

FF39A000       8       8       -       - rwx--  libsocket.so.1

FF3A0000       8       8       -       - r-x--  libdl.so.1

FF3B0000       8       8       8       - rwx--    [ anon ]

FF3C0000     152     152       -       - r-x--  ld.so.1

FF3F6000       8       8       8       - rwx--  ld.so.1

FFBFE000       8       8       8       - rw---    [ stack ]

-------- ------- ------- ------- -------

total Kb    1848    1728      40       -

Here we see each chunk of memory, what it is being used for, how much space it is taking (virtual and real), and mode information.

DISK SPACE

df -h (Solaris >= 9): This command is popular on Linux, and just made its way into Solaris. df -h displays summary information about file systems in human-readable form:

$ df -h

Filesystem             size   used  avail capacity  Mounted on

/dev/dsk/c0t0d0s0      4.8G   1.7G   3.0G    37%    /

/proc                    0K     0K     0K     0%    /proc

mnttab                   0K     0K     0K     0%    /etc/mnttab

fd                       0K     0K     0K     0%    /dev/fd

swap                   848M    40K   848M     1%    /var/run

swap                   849M   1.0M   848M     1%    /tmp

/dev/dsk/c0t0d0s7       13G    78K    13G     1%    /export/home