Tuesday, February 5, 2008


Computer storage, computer memory, and often casually memory refer to computer components, devices and recording media that retain digital data used for computing for some interval of time. Computer storage provides one of the core functions of the modern computer, that of information retention. It is one of the fundamental components of all modern computers, and coupled with a central processing unit (CPU), implements the basic Von Neumann computer model used since the 1940s.
In contemporary usage, memory usually refers to a form of solid state storage known as random access memory (RAM) and sometimes other forms of fast but temporary storage. Similarly, storage more commonly refers to mass storage - optical discs, forms of magnetic storage like hard disks, and other types of storage which are slower than RAM, but of a more permanent nature. These contemporary distinctions are helpful, because they are also fundamental to the architecture of computers in general. As well, they reflect an important and significant technical difference between memory and mass storage devices, which has been blurred by the historical usage of the terms "main storage" (and sometimes "primary storage") for random access memory, and "secondary storage" for mass storage devices. This is explained in the following sections, in which the traditional "storage" terms are used as sub-headings for convenience.
The fundamental components of a general-purpose computer are arithmetic and logic unit, control circuitry, storage space, and input/output devices. If storage is removed, a computer can only function as a digital signal processing device (e.g. calculator, media player). The ability to store instructions that form a computer program, and the information that the instructions manipulate, is what makes stored program architecture computers versatile enough for general purposes.
A digital computer represents information using the binary numeral system. Text, numbers, pictures, audio, and nearly any other form of information can be converted into a string of bits, or binary digits, each of which has a value of 1 or 0. The most common unit of storage is the byte, equal to 8 bits. A piece of information can be manipulated by any computer whose storage space is large enough to accommodate the corresponding data, or the binary representation of the piece of information. For example, a computer with a storage space of eight million bits, or one megabyte, could be used to edit a small novel.
Various forms of storage, based on various natural phenomena, have been invented. So far, no practical universal storage medium exists, and all forms of storage have some drawbacks. Therefore a computer system usually contains several kinds of storage, each with an individual purpose, as shown in the diagram.

Primary storage
Secondary storage requires the computer to use its input/output channels to access the information, and is used for long-term storage of persistent information. However most computer operating systems also use secondary storage devices as virtual memory - to artificially increase the apparent amount of main memory in the computer. Secondary storage is also known as "mass storage", as shown in the diagram above. Secondary or mass storage is typically of much greater capacity than primary storage (main memory), but it is also much slower. In modern computers, hard disks are usually used for mass storage. The time taken to access a given byte of information stored on a hard disk is typically a few thousandths of a second, or milliseconds. By contrast, the time taken to access a given byte of information stored in random access memory is measured in thousand-millionths of a second, or nanoseconds. This illustrates the very significant speed difference which distinguishes solid-state memory from rotating magnetic storage devices: hard disks are typically about a million times slower than memory. Rotating optical storage devices, such as CD and DVD drives, are typically even slower than hard disks, although their access speeds are likely to improve with advances in technology. Therefore, the use of virtual memory, which is millions of times slower than "real" memory, significantly degrades the performance of any computer. Virtual memory is implemented by many operating systems using terms like swap file or "cache file". The main historical advantage of virtual memory was that it was much less expensive than real memory. That advantage is less relevant today, yet surprisingly most operating systems continue to implement it, despite the significant performance penalties.
Off-line storage is a system where the storage medium can be easily removed from the storage device. Off-line storage is used for data transfer and archival purposes. In modern computers, CDs, DVDs, memory cards, flash memory devices including "USB drives", floppy disks, Zip disks and magnetic tapes are commonly used for off-line mass storage purposes. "Hot-pluggable" USB hard disks are also available. Off-line storage devices used in the past include punched cards, microforms, and removable Winchester disk drums.

Secondary and off-line storage
Tertiary storage is a system where a robotic arm will "mount" (connect) or "dismount" off-line mass storage media (see the previous item) according to the computer operating system's demands. Tertiary storage is used in the realms of enterprise storage and scientific computing on large computer systems and business computer networks, and is something a typical personal computer user never sees firsthand.
Database storage is a system where information in computers is stored in large databases, data banks, data warehouses, or data vaults. It involves packing and storing large amounts of storage devices throughout a series of shelves in a room, usually an office, all linked together. The information in database storage systems can be accessed by a supercomputer, mainframe computer, or personal computer. Databases, data banks, and data warehouses, etc, can only be accessed by authorized users.

Tertiary and database storage
Network storage is any type of computer storage that involves accessing information over a computer network. Network storage arguably allows to centralize the information management in an organization, and to reduce the duplication of information. Network storage includes:
Confusingly, these terms are sometimes used differently. Primary storage can be used to refer to local random-access disk storage, which should properly be called secondary storage. If this type of storage is called primary storage, then the term secondary storage would refer to offline, sequential-access storage like tape media.

Network-attached storage is secondary or tertiary storage attached to a computer which another computer can access at file level over a local-area network, a private wide-area network, or in the case of online file storage, over the Internet.
Storage area network provides other computers with storage capacity over a network, the crucial difference between network-attached storage (NAS) and storage area Networks (SAN) is the former presents and manages file systems to client computers, whilst a SAN provides access to disks at block addressing level, leaving it to attaching systems to manage data or file systems within the provided capacity.
Network computers are computers that do not contain internal secondary storage devices. Instead, documents and other data are stored on a network-attached storage. Network storage
The division to primary, secondary, tertiary and off-line storage is based on memory hierarchy, or distance from the central processing unit. There are also other ways to characterize various types of storage.

Characteristics of storage

Volatile memory requires constant power to maintain the stored information. Volatile memory is typically used only for primary storage. (Primary storage is not necessarily volatile, even though today's most cost-effective primary storage technologies are. Non-volatile technologies have been widely used for primary storage in the past and may again be in the future.)
Non-volatile memory will retain the stored information even if it is not constantly supplied with electric power. It is suitable for long-term storage of information, and therefore used for secondary, tertiary, and off-line storage.
Dynamic memory is volatile memory which also requires that stored information is periodically refreshed, or read and rewritten without modifications. Computer memory Volatility of information

Random access means that any location in storage can be accessed at any moment in the same, usually small, amount of time. This makes random access memory well suited for primary storage.
Sequential access means that the accessing a piece of information will take a varying amount of time, depending on which piece of information was accessed last. The device may need to seek (e.g. to position the read/write head correctly), or cycle (e.g. to wait for the correct location in a revolving medium to appear below the read/write head). Ability to access non-contiguous information

Read/write storage, or mutable storage, allows information to be overwritten at any time. A computer without some amount of read/write storage for primary storage purposes would be useless for many tasks. Modern computers typically use read/write storage also for secondary storage.
Read only storage retains the information stored at the time of manufacture, and write once storage (WORM) allows the information to be written only once at some point after manufacture. These are called immutable storage. Immutable storage is used for tertiary and off-line storage. Examples include CD-R.
Slow write, fast read storage is read/write storage which allows information to be overwritten multiple times, but with the write operation being much slower than the read operation. Examples include CD-RW. Ability to change information

In location-addressable storage, each individually accessible unit of information in storage is selected with its numerical memory address. In modern computers, location-addressable storage usually limits to primary storage, accessed internally by computer programs, since location-addressability is very efficient, but burdensome for humans.
In file system storage, information is divided into files of variable length, and a particular file is selected with human-readable directory and file names. The underlying device is still location-addressable, but the operating system of a computer provides the file system abstraction to make the operation more understandable. In modern computers, secondary, tertiary and off-line storage use file systems.
In content-addressable storage, each individually accessible unit of information is selected with a hash value, or a short identifier with number? pertaining to the memory address the information is stored on. Content-addressable storage can be implemented using software (computer program) or hardware (computer device), with hardware being faster but more expensive option. Addressability of information

Storage capacity is the total amount of stored information that a storage device or medium can hold. It is expressed as a quantity of bits or bytes (e.g. 10.4 megabytes).
Storage density refers to the compactness of stored information. It is the storage capacity of a medium divided with a unit of length, area or volume (e.g. 1.2 megabytes per square inch).
Latency is the time it takes to access a particular location in storage. The relevant unit of measurement is typically nanosecond for primary storage, millisecond for secondary storage, and second for tertiary storage. It may make sense to separate read latency and write latency, and in case of sequential access storage, minimum, maximum and average latency.
Throughput is the rate at which information can be read from or written to the storage. In computer storage, throughput is usually expressed in terms of megabytes per second or MB/s, though bit rate may also be used. As with latency, read rate and write rate may need to be differentiated. Capacity and performance

Technologies, devices and media
Magnetic storage uses different patterns of magnetization on a magnetically coated surface to store information. Magnetic storage is non-volatile. The information is accessed using one or more read/write heads. Since the read/write head only covers a part of the surface, magnetic storage is sequential access and must seek, cycle or both. In modern computers, the magnetic surface will take these forms:
In early computers, magnetic storage was also used for primary storage in a form of magnetic drum, or core memory, core rope memory, thin film memory, twistor memory or bubble memory. Also unlike today, magnetic tape was often used for secondary storage.

Magnetic disk

  • Floppy disk, used for off-line storage
    Hard disk, used for secondary storage
    Magnetic tape data storage, used for tertiary and off-line storage Magnetic storage
    Semiconductor memory uses semiconductor-based integrated circuits to store information. A semiconductor memory chip may contain millions of tiny transistors or capacitors. Both volatile and non-volatile forms of semiconductor memory exist. In modern computers, primary storage almost exclusively consists of dynamic volatile semiconductor memory or dynamic random access memory. Since the turn of the century, a type of non-volatile semiconductor memory known as flash memory has steadily gained share as off-line storage for home computers. Non-volatile semiconductor memory is also used for secondary storage in various advanced electronic devices and specialized computers.

    Semiconductor storage
    Optical disc storage uses tiny pits etched on the surface of a circular disc to store information, and reads this information by illuminating the surface with a laser diode and observing the reflection. Optical disc storage is non-volatile and sequential access. The following forms are currently in common use:
    The following form have also been proposed:

    CD, CD-ROM, DVD: Read only storage, used for mass distribution of digital information (music, video, computer programs)
    CD-R, DVD-R, DVD+R: Write once storage, used for tertiary and off-line storage
    CD-RW, DVD-RW, DVD+RW, DVD-RAM: Slow write, fast read storage, used for tertiary and off-line storage
    Blu-ray (BD)
    HD DVD
    Ultra Density Optical (UDO)
    Professional Disc for DATA (PDD or ProDATA)
    Holographic Versatile Disc (HVD)
    Phase-change Dual
    3D optical data storage Optical disc storage
    Magneto-optical disc storage is optical disc storage where the magnetic state on a ferromagnetic surface stores information. The information is read optically and written by combining magnetic and optical methods. Magneto-optical disc storage is non-volatile, sequential access, slow write, fast read storage used for tertiary and off-line storage.

    Ultra Density Optical disc storage
    Optical jukebox storage is a robotic storage device that utilizes optical disk device and can automatically load and unload optical disks and provide terabytes of near-line information. The devices are often called optical disk libraries, robotic drives, or autochangers. Jukebox devices may have up to 1,000 slots for disks, and usually have a picking device that traverses the slots and drives. The arrangement of the slots and picking devices affects performance, depending on the space between a disk and the picking device. Seek times and transfer rates vary depending upon the optical technology. Jukeboxes are used in high-capacity archive storage environments such as imaging, medical, and video. HSM is a strategy that moves little-used or unused files from fast magnetic storage to optical jukebox devices in a process called migration. If the files are needed, they are migrated back to magnetic disk.

    Optical Jukebox storage
    Paper tape and punch cards have been used to store information for automatic processing since the 1890s, long before general-purpose computers existed. Information was recorded by punching holes into the paper or cardboard medium, and was read by electrically (or, later, optically) sensing whether a particular location on the medium was solid or contained a hole.
    Williams tube used a cathode ray tube, and Selectron tube used a large vacuum tube to store information. These primary storage devices were short-lived in the market, since Williams tube was unreliable and Selectron tube was expensive.
    Delay line memory used sound waves in a substance such as mercury to store information. Delay line memory was dynamic volatile, cycle sequential read/write storage, and was used for primary storage.

    Other early methods
    Phase-change memory uses different mechanical phases of phase change material to store information, and reads the information by observing the varying electric resistance of the material. Phase-change memory would be non-volatile, random access read/write storage, and might be used for primary, secondary and off-line storage.
    Holographic storage stores information optically inside crystals or photopolymers. Holographic storage can utilize the whole volume of the storage medium, unlike optical disc storage which is limited to a small number of surface layers. Holographic storage would be non-volatile, sequential access, and either write once or read/write storage. It might be used for secondary and off-line storage.
    Molecular memory stores information in polymers that can store electric charge. Molecular memory might be especially suited for primary storage.

    Other proposed methods

    Memory management

    • Virtual memory
      Physical memory
      Memory allocation

      • Dynamic memory
        Memory leak
        Memory protection
        Flash memory
        Solid state drive
        Dynamic random access memory
        Static random access memory Secondary, tertiary and off-line storage topics

        Storage Decisions
        Storage Networking World
        Storage World Conference

No comments: